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Foreword 


The only portrait in the Council Room of the 
Royal Society in London is one of Joseph Banks, 
whose powerful presence still seems to dominate 
the room. Banks was President for an astonishing 
42 years, from 1778 to 1820. These were, more¬ 
over, turbulent years: the French Revolution, the 
Napoleonic Wars, and subsequent post-war unrest 
in Britain (including the Peterloo massacre). Less 
commonly appreciated is the fact that these were 
also turbulent years within this old-established 
and influential scientific society. Up until the later 
years of the eighteenth century, 'science' essen¬ 
tially meant mathematical and physical science. 
Banks was a botanist (he studied briefly under Lin¬ 
naeus), and influential members of the Royal Soci¬ 
ety deprecated him as being not so much a scientist 
as a gardener (albeit a politically well-connected 
one). 

Underlying all this is the astonishing fact that 
the first stirrings of systematic interest in catalogu¬ 
ing the diversity of other living things with which 
humans share the planet came a full century' after 
the foundation of the Royal Society in 1660 and 
Newton's providing a fundamental understanding 
of the laws of motion, along with the inverse square 
law of gravitational attraction. The corresponding 
date for Linnaeus' binary codification in De Rerum 
Naturae (which recognized a global total of around 
9000 species of plants and animals) is 1758. In 
many ways the legacy of this century-long lag still 
lingers. 

Today the global total number of distinct species 
of eukaryotes (never mind the viruses and bac¬ 
teria, where there are essential differences in the 
definition of 'species') that have been named 
and recorded is probably around 1.6-1.7 million, 
with roughly 15,000 being added each year (May 


2007). Part of the uncertainty derives from lack 
of synoptic databases for many groups, which 
compounds difficulties in resolving synonyms— 
the same species independently found, and then 
named and recorded, in two or more different 
places by different people. Today's true total of dis¬ 
tinct eukaryotic species is estimated to lie in the 
range of 5-10 million, with figures as low as 3 
million or as high as 100 million being defensible. 
These lamentable uncertainties result partly from 
what a management consultant would call ineffi¬ 
ciencies in the distribution of the relevant work¬ 
force. Although the taxonomy of taxonomists is 
itself poorly documented, rough estimates suggest 
it is approximately evenly divided among verte¬ 
brates, plants, and invertebrates (Gaston & May 
1992), but there are around 10 plant species, and at 
least 100 invertebrate species (possibly more than 
1000), for each vertebrate species. The labour force 
is even more inefficiently divided if one consid¬ 
ers the research literature on conservation biology. 
Here an analysis of the 2700 papers published in 
Conservation Biology and Biological Conservation 
between 1979 and 1998 showed 69% devoted to 
vertebrates, 20% to plants, and 11% to all inverte¬ 
brates (with half of these being lepidoptera, which 
appear to have the status of honorary birds) (Clark 
& May 2002). And conservation action, as indi¬ 
cated, for example, by the World Wildlife Fund's 
Annual Report, is almost wholly devoted to charis¬ 
matic megavertebrates. This is understandable in 
view of public attitudes, but arguably unfortunate 
in terms of preserving ecosystem functioning; the 
argument that protecting vertebrate biodiversity 
will more-or-less automatically also preserve inver¬ 
tebrate biodiversity does not survive close exami¬ 
nation (Prendergast et al. 1993). 
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xv j FOREWORD 


The present volume is, of course, focused 
on advances—along with remaining areas of 
ignorance—in our understanding of patterns of bio¬ 
logical diversity: what is there and how best to 
measure and characterize it. The broad range of 
relevant topics covered in the book is clearly and 
succinctly set out in Magurran and McGill's intro¬ 
ductory Chapter 1, and so I make no effort to 
duplicate it in this Foreword, but I cannot resist re¬ 
emphasizing how many aspects of this quest for 
understanding began much more recently than Lin¬ 
naeus. In particular, note that the first published 
study of the important issue of species' relative 
abundance (SRA) is only 100 years old (Raunkaier 
1909). 

Quite a few of the chapters deal with vari¬ 
ous "indices", aiming to reduce the often com¬ 
plex details of the relative abundance of species 
in a defined area or ecosystem to a single num¬ 
ber (or, rarely, a couple of numbers). For pur¬ 
poses of quick assessments or comparisons, this 
is understandable, and often sensible. Flowever— 
and here 1 am riding one of my hobbyhorses—1 
believe that whenever possible it is better to use the 
full distribution of SRA. Condensing this informa¬ 
tion, often the fruit of much labour, into a couple 
of indices such as mean and variance (or some¬ 
thing equivalent) may seem convenient, but the 
loss of detail can easily result in important fea¬ 
tures of the relevant community assembly being 
overlooked. Particularly pernicious, in my view, 
is the so-called Shannon-Wiener diversity index, 
H = — p.lnp, (with p, the number of individ¬ 
uals in the ith species, as a fraction of the total). 
Whilst entrancingly tempting in its resonance with 
information theory, by taking logarithms to get the 
Shannon-Wiener index one is greatly compressing 
the data (remember the difference between 100 and 
1000000 is less than a factor of 10 once natural log¬ 
arithms are taken), and thereby significantly weak¬ 
ening the discriminatory power of the index. If 
one must use simpler indices, at least avoid those 
based on logarithmic transformations of the data 
(May 1975). 

Another thing to keep in mind when con¬ 
sidering patterns of abundance in relation to 
threats to biodiversity is that rarity does not 


necessarily imply endangerment. Nor does great 
abundance necessarily imply robustness: witness 
the passenger pigeon. This point was first empha¬ 
sized, with telling examples, in Rabinowitz's clas¬ 
sic paper 'Seven Forms of Rarity'. It is worth 
emphasizing again (Rabinowitz 1981; Rabinowitz 
et al 1986). 

The pressures currently being inflicted on nat¬ 
ural communities of plants and animals are huge, 
and increasing. Since Darwin published the The 
Origin of Species roughly 150 years ago, human 
numbers have increased sevenfold, and the energy 
use per person has increased by a similar fac¬ 
tor, resulting in a 50-fold increase in our over¬ 
all impact on our planet's ecosystems. Vitousek 
et al.'s (1986) estimate that humanity takes to itself, 
directly or indirectly, roughly 40% of terrestrial net 
primary productivity has recently been validated 
by satellite images of the land area modified by 
us (Sachs 2008). Even more extraord inary, of all 
the atmospheric nitrogen fixed in 2008, 55% came 
from the Harber-Bosch chemical process rather than 
the natural biogeochemical processes which cre¬ 
ated, and which struggle to maintain, the biosphere 
(Sachs 2008). 

All this has consequences for the services that 
natural ecosystems deliver to us. Such ecosystem 
services are not counted in conventional measures 
of gross domestic product (GDP), although rough 
estimates of their value puts it comparable with, or 
greater than, the global GDP of conventional eco¬ 
nomics (Constanza et al. 1997). The recent Millen¬ 
nium Ecosystem Assessment (MEA) classifies these 
various ecosystem services under 24 broad head¬ 
ings (Millennium Ecosystem Assessment 2005; see 
also May 2007). Of these services, 15 are assessed as 
currently degrading, four improving (these mainly 
relate to food production), and we do not under¬ 
stand the remaining five well enough to evaluate 
them. 

Not least among the areas where we need deeper 
understanding of the origins and maintenance of 
biological diversity is how might humans alter 
habitats and ecosystems to provide for their needs, 
but do so subject to constraints which preserve 
both particular individual species and key ele¬ 
ments of ecosystems. This will not be easy. Many 
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purists will flinch at the prospect. The alternative, 
however, would seem too often to be a mosaic 
of degraded habitat, increasingly threatening ded¬ 
icated reserves, with all the tensions thus entailed. 
Terborgh's (1983) Five Neu> World Primates is a pio¬ 
neering work in this arena. 


In summary, the present book could not be more 
timely. It gives a comprehensive and authoritative 
account both of what we do know, and what we do 
not. As such, it provides a toolkit of useful infor¬ 
mation and techniques, and also a guide to areas 
requiring further work. 

Robert M. May 
Zoology Department 
Oxford University 
Oxford 0X1 3PS, UK 



CHAPTER 1 


Challenges and opportunities 
in the measurement and assessment 
of biological diversity 

Anne E. Magurran and Brian J. McGill 


'When we look at the plants and bushes clothing an 
entangled bank, we are tempted to attribute their pro¬ 
portional numbers and kinds to what we call chance. 
But how false a view is this! Every one has heard that 
when an American forest is cut down, a very different 
vegetation springs up; but it has been observed that 
ancient Indian ruins in the Southern United States, 
which must formerly have been cleared of trees, now 
display the same beautiful diversity and proportion of 
kinds as in the surrounding virgin forests.' 

-Darwin, Origin of Species (1859) 

1.1 Introduction 

By the 1800s naturalists had begun to document a 
fact that was surely apparent to the earliest humans: 
that species vary markedly in how common they 
are. Audubon, for example, noted that bird species 
in North America differ in abundance by up to 
seven orders of magnitude (McGill 2006; McGill 
et al. 2007). Darwin's (1859) reflections on the obser¬ 
vation that some taxa are abundant yet many are 
rare, and that there are geographical differences in 
the abundances and types of species, shaped his 
thinking about natural selection. In drawing atten¬ 
tion to the 'proportional numbers and kinds' of 
species in nature, however, Darwin also identified 
questions that continue to challenge ecologists a 
century and a half later. 

Humans have a strong intuitive sense of the dis¬ 
tinction between high diversity and low diversity 
assemblages, as encapsulated, for example, by the 
contrast between a tropical rain forest and a mono¬ 
culture planted for timber. Moreover, the inequality 


of species abundances that Darwin reported is 
so pervasive as to have acquired the status of 
an 'ecological law' (McGill et al. 2007). However, 
quantifying the biological diversity of a community, 
assessing differences in this diversity over space or 
time, and doing so in ways that are useful to those 
who seek explanations for the natural patterns as 
well as to the managers and policy makers charged 
with the sustainable use of wild nature, is by no 
means as simple as it first appears. The reason for 
this difficulty is that biological diversity is a mul¬ 
tifaceted concept that can be defined and docu¬ 
mented in different ways. Being clear about exactly 
what we mean by biological diversity (or biodiver¬ 
sity) is the first step towards measuring it. Even 
then the user can be confronted by a myriad mea¬ 
sures, some of which will do a better job than others. 
The goal of this book is to guide readers through 
the entangled bank of biodiversity measures and 
provide an up-to-date and accessible account of this 
important and rapidly expanding field. 

1.2 State of the field 

2010 marks the United Nations International Year of 
Biodiversity (UN IYB), an initiative that highlights 
the growing need to provide informative and robust 
assessments of biological diversity. The objectives 
of the IYB are to promote the protection of biodi¬ 
versity and encourage organizations, institutions, 
companies, and individuals to take direct action 
to reduce the constant loss of biological diver¬ 
sity worldwide. However, 2010 is also the target 
date set by the 2002 World Summit on Sustainable 
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Development 'for significant reduction in the cur¬ 
rent rate of loss of biological diversity'. It is by now 
generally accepted that the 2010 target will not be 
met, and of course there are many reasons for this, 
one of which is the ongoing debate about how best 
to measure biological diversity. 

The measurement of biological diversity has res¬ 
onances with the parable of the blind men and 
the elephant where one felt the trunk, one an ear, 
one a tusk and another a leg, only to find that 
they disagreed about the nature of the creature they 
had encountered. Quantifying biological diversity 
is in some ways like trying to measure an elephant 
(Nanney 2004). An elephant might be described in 
terms of its morphology or behaviour or neural cir¬ 
cuitry, or DNA. Gene expression might be tracked 
through time or compared amongst individuals. 
The social interactions in different herds might also 
be explored. All of these approaches are valid, but 
clearly different. The investigator needs to spec¬ 
ify which aspect of the elephant is being assessed, 
and then make the case that the types of measures 
adopted are employed correctly and are suitable 
for the purpose for which they are bring used. 
In ‘the same way that there is no single method 
that entirely captures the essence of elephantness, 
so there is no one metric that perfectly quantifies 
biological diversity. A large, and growing, num¬ 
ber of diversity measures have been developed 
(Southwood & Henderson 2000; Royal Society 2003; 
Magurran 2004). Southwood (1978) (p. 421) wryly 
noted the 'explosive speciation' of diversity indices 
and the habit of inventors of new measures to con¬ 
demn their predecessors. Biodiversity assessment 
embraces not just the classical measures of richness, 
evenness, and species abundance distributions but 
also evaluations of occurrence, range size, and vul¬ 
nerability as well as functional traits and phyloge¬ 
netic diversity. It is clear that there can be no single 
'best buy' index that will suit all needs. Instead, 
clusters of measures have been developed to cap¬ 
ture a certain aspect of biodiversity, with some 
of these performing this task better than others. 
Although it may seem like a disadvantage to have 
to choose amongst a range of metrics, the ability to 
examine biological diversity in different ways not 
only helps ecologists gain a "deeper understanding 
of how ecosystems function, but also sheds light on 


issues of practical concern such as the link between 
diversity and ecosystem services. 

The quantification of biological diversity proba¬ 
bly began with Darwin, who in 1855 recorded all 
the plants in the meadow. Great Pucklands, next 
to his family home at Downe. His list ran to 142 
species. It was over 50 years before the first account 
of the relative abundance of species was pub¬ 
lished by Raunkaier in 1909. Other early, yet piv¬ 
otal, papers by Motomura (1932) and Fisher (Fisher 
et al. 1943) laid the foundations for the investiga¬ 
tion of species abundance distributions. Additional 
insightful and significant contributions (including 
those by Preston (1948), MacArthur (1960), May 
(1975), Pielou (1975), Taylor et al. (1976), and Sug- 
ihara (1980) followed. These provide the frame¬ 
work within which biodiversity is measured, and 
still repay careful reading. Two developments that 
occurred towards the end of the twentieth century, 
however, shifted the measurement and assessment 
of biological diversity to a new gear. The first of 
these was the recognition that biological diversity is 
a crucial global resource as well as one that is being 
lost at an accelerating rate, the second the advances 
in computing power that have fostered substan¬ 
tial improvements in statistical and modelling 
techniques. 

1.3 What is in this book 

Although vastly improved analytical and comput¬ 
ing resources are of tremendous benefit to ecolo¬ 
gists, the rapid development of new methods, the 
inconsistent and sometimes confusing application 
of old ones, and the lack of agreement in the liter¬ 
ature about the best approach means that users can 
be bewildered. This book provides an up-to-date 
account of the methods used to measure and assess 
biological diversity and places particular empha¬ 
sis on the practical issues involved in measure¬ 
ment. It extends the discussion in Magurran (2004) 
and includes many new developments as well as a 
re-examination of familiar approaches. Our focus is 
measurement of the variety, abundance, and geo¬ 
graphical occurrence of taxa, reflecting the sense 
in which Darwin used the term 'diversity'. The 
authors have wide experience of the measurement 
of biological diversity and have contributed to 
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recent advances. As befits a vibrant field, our con¬ 
tributors do not invariably agree on which metrics 
and approaches are the best ones, although there is 
broad consensus about the essentials of biodiversity 
assessment. The various chapters set out the issues 
that investigators must consider and explain the 
advantages and disadvantages of different meth¬ 
ods, thus helping the reader to reach a sensible con¬ 
clusion about the best course of action in their own 
particular study system. 

The book is primarily aimed at those who need 
to measure and assess biological diversity. We have 
in mind upper-level undergraduates engaged in 
research projects, graduate students, and postdoc¬ 
toral researchers, as well as environmental man¬ 
agers and conservation biologists. Although not 
designed as a theoretical ecology text, we nonethe¬ 
less hope that the issues raised in the book will 
spark the interest of modellers and theoreticians. In 
each chapter the author(s) describe the state of the 
field, discuss recommendations and future direc¬ 
tions, and end with key points. 

The book begins with an overview of the basic 
measurement issues involved in biodiversity stud¬ 
ies. As Scott Bonar, Jeffrey Fehmi, and Norman 
Mercado-Silva make plain, survey design and 
methodology play a crucial role in the success— 
or otherwise—of biodiversity investigations. These 
are well-known concerns, although they often get 
much less attention than they deserve; Chapter 2 
sets out the points that must be addressed before 
an investigation can even begin. In contrast Stephen 
Buckland, Angelika Studeny, Anne Magurran, and 
Stuart Newson highlight a problem that is poten¬ 
tially of great importance but one that has, until 
now, been almost entirely overlooked. Most diver¬ 
sity statistics proceed on the assumption that indi¬ 
viduals and species have been collected at random 
from the community of interest and most investiga¬ 
tors use these statistics irrespective of whether this 
assumption is fulfilled. In practice individuals and 
species often vary considerably in how easy they 
are to detect. Buckland et al. show how detectabil¬ 
ity can affect the conclusions drawn from diversity 
statistics, and provide advice on how to deal with 
the issues raised, although it is clear that assessing 
individual and species variation in detectability is 
easier for some taxa than for others. 


The next section deals with the approaches that 
are sometimes described as measures of 'species 
diversity'. Although the term 'biodiversity' can 
mean many things, and is sometimes used in a 
generic way (EASAC 2009), most scientists, man¬ 
agers, and policy makers identify species richness 
as a central component. In many ways species 
richness—the number of species in a given local¬ 
ity or assemblage—is the iconic measure of bio¬ 
logical diversity. It is used to identify biodiver¬ 
sity hotspots and plays an important role in con¬ 
servation planning. Specfes richness also accords 
well with our intuitive sense of biological diversity. 
However, despite its wide appeal and apparent sim¬ 
plicity, accurate estimates of species richness can be 
remarkably difficult to achieve. This is not just at 
the global level, where it is still unclear, to within 
at least an order of magnitude, how many species 
inhabit the planet, but also at local levels and 
even for taxonomically well-characterized organ¬ 
isms. The reason for this difficulty lies in the obser¬ 
vations of Darwin and Audobon—because most 
species are rare, an increase in sampling effort will 
almost always lead to an increase in richness. Fortu¬ 
nately there has been intense interest in recent years 
in providing solutions to this problem; Nicholas 
Gotelli and Robert Colwell provide a clear account 
of the approaches that can be used to make fair 
comparisons amongst sites where sampling effort 
differs (which can happen even if the investigator 
thinks that it is the same) and examine the new 
generation of non-parametric estimators that can be 
used to deduce the minimum number of species 
present in an assemblage. 

Although species richness is widely used as a 
measure of biological diversity, investigators often 
want to find a means of quantifying Darwin's 'pro¬ 
portional numbers and kinds' in a single statis¬ 
tic. As noted at the beginning of this chapter all 
communities consist of species that vary in their 
abundance. However, communities (or localities) 
can also differ from one another in terms of their 
proportions of species, that is, in how 'even' their 
species abundance distributions are. The degree 
of evenness can shed light on the processes that 
shape a community's structure or provide a gauge 
of the impacts on it. There are a swathe of measures 
that populate the ground between species richness 
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and evenness. These include familiar choices such 
as the Shannon, Simpson, and Margalef indexes 
and a host of less well-known ones. Brian Maurer 
and Brian McGill explore this territory in Chap¬ 
ter 5 and provides guidance on the use of diversity 
statistics. 

Researchers may also want to go beyond an 
assessment of the diversity of a single site to say 
something about the larger picture. This might 
involve drawing conclusions about the distribution 
of biological diversity amongst a series of com¬ 
munities or examining sifts in diversity across a 
gradient. Whittaker (1960) first made the distinc¬ 
tion between a diversity—the diversity of a site 
or habitat—and (3 diversity—the difference in com¬ 
positional diversity between two or more sites, a 
and (3 diversity combine to give y diversity—the 
diversity of the landscape. Until recently measures 
of compositional similarity and (3 diversity received 
less attention than methods of assessing a diver¬ 
sity. There has, however, been a marked increase 
of interest in this topic during the last decade and 
in Chapter 6 Lou Jost, Anne Chao, and Robin Chaz- 
don provide an overview of this rapidly developing 
field. 

The idea that the number of species recorded 
increases if a site is surveyed over an extended 
period has deep roots in ecology and was first for¬ 
malized by Preston, who argued that species area 
and species time curves are equivalent. The urgent 
need—highlighted by the UN's IYB—to quantify 
changes in ecological communities and to deter¬ 
mine whether the rate of biodiversity loss is accel¬ 
erating or slowing has led to a flurry of research in 
this field. Anne Magurran reviews methods of eval¬ 
uating temporal turnover in Chapter 7, but argues 
that there is still much to be learnt about baseline 
changes in ecological communities. 

A single index of biodiversity limits the amount 
of information that can be conveyed. An alternative 
is to examine the structure of a community in terms 
of its species abundance distribution or spatial pat¬ 
terning in more detail. This, is the subject matter 
of the next section of the book. In Chapter 8 Anne 
Magurran and Peter Henderson explore the eco¬ 
logical context of commonness and rarity, and dis¬ 
cuss methods that can be usecffo identify and track 
common and rare species. A species abundance 


distribution is the formal description of the abun¬ 
dance of the recorded species in a community, as 
well as a tool that is widely used to compare com¬ 
munities and to test hypotheses about community 
structure (McGill et al. 2007). Brian McGill exam¬ 
ines species abundance distributions in detail in 
Chapter 9. After summarizing current methods of 
studying species abundance distributions, he sug¬ 
gests a new way to visually display species abun¬ 
dance distributions that does not suffer from some 
of the limitations of earlier graphical methods. He 
also presents a detailed analysis of which quanti¬ 
tative measures behave well under small sample 
sizes and which measures are truly independent of 
each other. Based on this he offers some guidelines 
on which measures to use under several scenarios. 
One of the main challenges with regard to species 
abundance distributions is how best to fit them. 
In Chapter 10 Sean Connolly and Maria Dornelas 
critically evaluate different approaches for fitting 
species abundance models to data and examine 
methods of quantifying goodness of fit. As they 
conclude, model selection statistics are a promising 
development and have advantages over traditional 
approaches such as graphical inspection and com¬ 
parative analyses of goodness-of-fit statistics. 

Whereas some ecologists focus on the distribu¬ 
tions of species abundances others are concerned 
with the spatial placement of species. As Kevin Gas¬ 
ton and Fangliang He argue in Chapter 11, one of 
the most fundamental units of biodiversity is the 
presence or absence of a species in a given site, 
such as habitat patch, an island, or a grid square 
on a map. They show that simple species x sites 
(r x c) matrices, in which the presence/absence 
of different species (in rows r) is given for a set 
of different sites (in columns c ), lie at the heart of 
many biodiversity studies and are linked to pat¬ 
terns such as species-area relationships, nested¬ 
ness, and gradients in (3 diversity. Gaston and He 
explore occupancy-area relationships, occupancy- 
abundance relationships, and species-occupancy 
distributions, and note that a key challenge is 
to estimate species abundance from occupancy. 
Brian McGill develops the spatial perspective in 
Chapter 12 by examining the implications of the 
spatial structure of biodiversity for management 
and conservation. New developments in spatial 
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statistics are explored and applied to questions such 
as 'is there variation in abundance across space?' 
and 'are there interactions across space?'. The analy¬ 
sis of spatial structure is an example of an area of 
biodiversity in which new analytical techniques are 
proving particularly important. 

Historically most investigators have used species 
(or in some cases morphospecies) as their bio¬ 
diversity currency. However, some ecologists, for 
example Pielou (1975), were ahead of their time 
in recognizing that it is possible to increase the 
information content of biodiversity assessment by 
including taxonomic or phylogenetic information. 
This view has been reinforced by growing aware¬ 
ness of the need to conserve biodiversity func¬ 
tion and by rapid advances in methodology. Recent 
developments in this field are discussed in the next 
section of the book, on alternative measures of 
diversity. The section begins with a contribution by 
Evan Weiher on trait diversity. He argues in Chap¬ 
ter 13 that there are four conceptual aspects to trait 
diversity: the occupation of trait space, functional 
evenness, functional divergence, and the density of 
species packing. A rapidly growing set of indices is 
available and guidance is provided on the choices. 
As always the approach will depend on one's goal. 
For example, a multivariate method may be most 
appropriate if the goal is to assess the amount of 
trait diversity in a community. 

Phylogenetic diversity is another area in which 
large numbers of metrics are being developed. 
Much of the impetus behind this rapid expan¬ 
sion is the growing awareness that these mea¬ 
sures have an important role to play in both con¬ 
servation biology and community ecology. Mark 
Vellend, William Cornwell, Karen Magnuson-Ford, 
and Arne Mooers provide a conceptual overview 
of phylogenetic diversity in Chapter 14 and report 
the results of simulation analyses that examine 
artificial communities constructed using a range 
of assumptions about phylogeny structure and 
assemblage composition. Vellend and coauthors 
also explore the qualitative and quantitative rela¬ 
tionships among metrics and advise on choosing 
metrics for different purposes. They point out, for 
instance, that the sensitivity of phylogenetic diver¬ 
sity metrics depends to a large extent on the shape 
of the phylogenetic tree. Improved phylogenetic 


information is, of course, underpinned by the revo¬ 
lution in molecular ecology. This means that rather 
than counting species—or even evaluating traits 
and phylogenies—investigators can now directly 
evaluate the diversity of genes. Melanie Culver, 
Robert Fitak, and Hans-Wemer Herrmann discuss 
these opportunities in Chapter 15. As they point 
out, genetic diversity can be measured using a 
variety of molecular genetic markers, each with 
a different evolutionary rate. Markers have differ¬ 
ent resolving power, so the choice of marker will 
depend on the biodiversity question, and the level 
of resolution needed to address that. 

As will already be clear, the methods presented in 
this book are relevant to a wide range of theoretical 
and applied questions. The final section, on appli¬ 
cations, presents a set of topics and case studies in 
which diversity measurement and assessment plays 
a fundamental role. Lise 0vreas and Thomas Curtis 
argue in Chapter 16 that microbial diversity is the 
outermost frontier in the exploration of diversity. 
They note that the traditional species concept is 
useless in microbial ecology and focus instead on 
evolutionary relationships inferred from changes 
in molecules, primarily rRNA. 0vreas and Curtis 
note the importance of sampling and look ahead 
to the advent of novel massive parallel sequencing 
as a means of providing a comprehensive examina¬ 
tion of microbial communities. Although advances 
in microbial ecology are driven by technology, the 
assessment of microbial diversity is also dependent 
on statistical techniques, such as the non-parametric 
richness estimators devised by Anne Chao. 

Chapter 17 turns to another problem of consid¬ 
erable contemporary concern, that is disturbance 
and its impact on ecosystems. Maria Dornelas, Can- 
dan Soykan, and Karl Inne Ugland examine the 
use of biodiversity metrics to measure the effects 
of disturbance and observe that the sheer num¬ 
ber of methods available can make it difficult 
to select an appropriate measure. This difficulty 
is heightened by the lack of rigorous compara¬ 
tive studies exploring the merits of the different 
approaches. They argue that the most appropriate 
method will vary from case to case depending on 
the nature of the disturbance but recommend sev¬ 
eral options, including the use of multiple metrics. 
The empirical cumulative distribution function is 
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one new method that appears to have considerable 
merit. Traits-based metrics are also potentially valu¬ 
able. Dornelas et al. conclude by emphasizing that 
more theoretical work is needed to understand the 
effects of disturbances on biodiversity metrics other 
than species richness. 

There are few landscapes in the world have 
not been modified by humans. Steven Chown and 
Melodie McGeoeh ask, in Chapter 18, how the goals 
of biodiversity measurement are shaped by this 
transformation. Biodiversity assessments of mod¬ 
ified landscapes such as agroecosystems draw on 
the techniques discussed in this book and, as else¬ 
where, the approach adopted in any particular case 
will depend on the question being posed. How¬ 
ever, it may be informative to explicitly address 
management issues, for example when assessing 
the biodiversity of a landscape that is a tapestry 
of modified habitats such as hedgerows and mono¬ 
cultures, or contains fields under a variety of agri¬ 
cultural systems, for example conventional and 
organic farming. In another context, conservation 
managers often seek to reduce impacts and reverse 
transformation. Chown and McGeoch argue that 
r x c matrices are a very useful addition to the 
toolkit of methods used to assess transformed land¬ 
scapes as they make it possible to explore patterns 
across different levels of biodiversity, such as genes, 
traits, and species. 

Quantifying the loss of biodiversity is a task that 
increasingly preoccupies ecologists and conserva¬ 
tion biologists. It is also a central goal in palaeontol¬ 
ogy and investigators who test ideas about extinc¬ 
tion in the context of the fossil record use methods 
that are similar to those employed by conservation 
biologists. In Chapter 19 Peter Wagner and Kath¬ 
leen Lyons review current palaeobiological meth¬ 
ods for inferring extinction patterns. These extend 
from 'traditional' methods that use information 
in stratigraphic ranges to approaches that exploit 
exact information about distributions of finds 
within stratigraphic ranges. As Wagner and Lyons 
show, the fossil record provides data that can be 
used to test a wide variety of extinction hypotheses. 

Michael Rosenzweig has been influential in shap¬ 
ing views about species area and species time 
curves (Rosenzweig 1995). *4n Chapter 20 he is 
joined by John Donoghue, Yue Max Li, and Chi 


Yuan in an examination of species density, partic¬ 
ularly its use in assessing and comparing sites in 
a management or conservation context. The attrac¬ 
tion of species density in such evaluations is clear. 
Although, as we have emphasized above, there is 
no single measure of biodiversity that serves all 
needs, many practitioners desire a few informative 
and intuitive metrics, ideally ones that appeal to 
politicians and policy makers. Species density is an 
obvious candidate but has a significant drawback in 
that it does not scale linearly with area. Rosenzweig 
and colleagues develop estimates of species density 
that take the curvature of species-area curves into 
account and highlight its value as a useful indicator 
of environmental condition. 

Our conclusion, in Chapter 21, says that the mea¬ 
surement of biodiversity is growing (and will con¬ 
tinue to grow) increasingly more sophisticated. This 
is to be welcomed as it allows for more nuanced 
discussions and more refined measures appropriate 
to specific questions. At the same time, the healthy 
growth of a tree requires regular pruning. Measure¬ 
ment of biodiversity is no different. Ecologists need 
to put as much energy into removing unsuccessful 
measures as we do into developing new measures. 
Finally, the increasingly complex ability of scientists 
to measure the multifaceted idea of biodiversity 
should not be allowed to distract from the societal 
and policy goals of conserving biodiversity. There 
is a real concept and relatively simple measures 
get us close enough to allow policy decisions and 
assessments to occur. 

This book covers a wide range of approaches and 
reflects the activity and excitement at the frontiers 
of biodiversity measurement and assessment. We 
are conscious that there are aspects of biodiversity 
measurement that we have not dealt with or have 
mentioned only briefly. These include issues such 
as population trends, the extent of habitats, the 
status of protected areas, the fate of threatened or 
invasive species, surrogate measures (for instance 
those based on remote sensing), and matters related 
to ecosystem services such as wild harvests. A 
single book could not do justice to all this mate¬ 
rial, and interested readers will find that there is 
a growing literature that tackles it. However, we 
stress that the fundamental issues that underpin 
many of these alternative and broader measures 
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are addressed in the book. For example, a compos¬ 
ite indicator or headline index is only as good as 
the estimates of species richness or species density 
subsumed within it, and sampling and detectablity 
issues need to be addressed in every study before 
robust conclusions can be drawn. As explained at 
the outset our emphasis in this book is on Darwin's 
'proportional numbers and kinds' and we hope that 
it will prove a useful guide to measuring and assess¬ 
ing this 'beautiful diversity'. 
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Basic Measurement Issues 








CHAPTER 2 


An overview of sampling issues 
in species diversity and 
abundance surveys 

Scott A. Bonar, Jeffrey S. Fehmi, and Norman Mercado-Silva 


2.1 Introduction 

Homer S. Swingle was a pioneer in pond man¬ 
agement and professor of fisheries at Auburn Uni¬ 
versity. Results from his scientific research on the 
principles of fish interactions in lakes and ponds, 
and methods to improve fish production are used 
worldwide, and he is credited with improving lives 
of countless people by increasing the amount of 
food available to those in developing countries and 
providing recreational opportunities to a multitude 
of anglers. His advice was sought by US presidents, 
prime ministers, the United Nations and countries 
across the globe (Byrd 1973). 

His biological work was groundbreaking, yet 
there is little evidence that Homer Swingle used 
sampling procedures when conducting his work. 
His major works contain no reports of sampling 
error with his estimates (Swingle 1950,1952) and his 
contemporaries have pointed out that he actively 
avoided subsampling in favor of census. His meth¬ 
ods for enumerating a population were simple. At 
the conclusion of his pond experiments, he would 
census every fish involved, either by draining or 
treating the pond with piscicide (fish poison). Being 
a student on Swingle's piscicide crew meant count¬ 
ing every dead fish as it floated to the water's sur¬ 
face in the hot Alabama sun over a 3-day period. 
Those involved in the third-day pickup were the 
unluckiest, and often the least senior members of 
the crew—a smelly task indeed! 

While a complete census of all experimental or 
survey subjects provides the most complete infor¬ 
mation, few of us have the luxury of counting every 


subject in a population to estimate population char¬ 
acteristics. It simply costs too much, takes too much 
time, or is infeasible in some other way. Therefore 
we have to sample a portion of the population to 
estimate diversity or population parameters such 
as abundance. For example, a marine biologist esti¬ 
mating the number of clams in the Gulf of Califor¬ 
nia cannot hope to count every one—he will expand 
an estimate calculated from a sample of plots. A 
plant biologist measuring the biomass of grasses 
in a field will not weigh every grass blade in the 
field—she will estimate the biomass through a sam¬ 
ple of plots containing grass. A wildlife biologist 
will not count all mice in a national park to deter¬ 
mine relative density of different species. Density 
will be determined by a subset of captures from 
different areas. 

To ensure the most useful information, a biolo¬ 
gist must be familiar with basic sampling issues. 
Here we discuss some issues that all biologists 
should know when sampling diversity, abundance, 
or other parameters. Consideration of these issues 
allows the biologist to sample in a manner that, 
while not a perfect reflection of the population, will 
provide the best representation of the true popula¬ 
tion as possible. 

2.2 State of the field 

Successful sampling of plant and animal commu¬ 
nities address similar basic issues. In the following 
sections we discuss considerations that every sam¬ 
pler should understand before beginning a survey. 


11 
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2.2.1 Setting objectives 

Critical to any good survey are clear objectives. 
These objectives should answer the fundamental 
questions for the survey. What is needed from the 
survey? How will the survey data be used? If the 
sampling design or methodology becomes confus¬ 
ing and complicated, returning to the objectives 
of the survey can help clarify what needs to be 
done. Deviating from the objectives of the sur¬ 
vey may make the information obtained from sam¬ 
pling much less valuable for answering the original 
questions. 

Explicit objectives can improve the study design. 
A poorly worded objective for a survey might be: 
'Estimate the effect of outflow from the Brandon 
Chemical Plant on freshwater clams.' More precise 
would be: 'Estimate the effect of outflow from the 
Brandon Chemical Plant on abundance, growth, 
and species diversity of freshwater clams.' Word¬ 
ing objectives as specifically as possible will allow 
biologists to design a survey that will meet those 
objectives. 

Objectives that drive plant and animal sampling 
efforts include estimating conservation status, the 
potential effect of human action or inaction on com¬ 
munities, or, perhaps the most common, comparing 
a trend in a site over time to adapt management. 

There are several other considerations when sam¬ 
pling diversity or abundance which help to define 
study objectives more clearly. 

2.2.2 An important partner: the statistician 

Just as a business executive needs the services of 
a good lawyer and a good accountant, a biolo¬ 
gist needs a good statistician. Go to any academic 
library and scores of books on sampling techniques 
are available. Which methods should be used to 
design a survey? Any biologist should have some 
background of sampling techniques, which may 
suffice for simple surveys. However, consultation 
with a good statistician can dramatically improve 
the sampling design. Finding a good statistician is 
like finding a good mechanic. The best ones are 
incredibly busy, and the biologist will probably 
need persistence to encortfage their involvement 
in a study. However, they can make the difference 


between a failed study and a study that can provide 
the most information for the lowest price. Which are 
the best statisticians for the biologist? They tend to 
be those who have on-the-ground, practical experi¬ 
ence with biological studies in addition to a solid 
statistical background. One talented statistician at 
a Pacific Northwestern University in the USA had a 
solid knowledge of his subject, and had also been a 
commercial salmon fisherman and an active partici¬ 
pant in field projects. The biologist could be assured 
that this individual would help to design a study 
that met the statistical requirements, but was also 
practical and logistically possible. 

2.2.3 What species to sample 

When sampling plant or animal diversity or abun¬ 
dance, a decision must be made as to which species 
to sample. At a recent meeting of the Ecological 
Society of America, Pulitzer Prize winning ecologist 
E.O. Wilson stated that a person could conceivably 
spend their entire life studying the species found 
associated with one tree stump. There are literally 
thousands of species, from bacteria to mammals, 
that make their home there—a staggering level of 
diversity. Practically, there are many species in any 
given area, so which do biologists sample? 

Study objectives can help to identify the species 
that should be studied and the methods required 
to study them. Typically there is more interest in 
some species than others. For example, in plants, 
sampling may be structured around plants asso¬ 
ciated with the abundance of food and habitat of 
an important wildlife species, invasive plants that 
impact desired plants, or rare plant species of spe¬ 
cial conservation concern. Plants with neutral or 
little impact on the survey objectives tend to be dis¬ 
counted. For animals, species of special conserva¬ 
tion concern can also be overall drivers of the study, 
such as those on threatened and endangered species 
lists. Other species might be indicators of ecolog¬ 
ical conditions such as the presence of Plecoptera 
(stonefly) or Chironomidae (chironomid) larvae in 
streams of varying water-quality conditions. Still 
others might be important species that drive a com¬ 
munity, such as the most abundant picivorous (fish¬ 
eating) and insectivorous (insect-eating) fish of a 
lake community that make up most of the biomass. 
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Not only species must be considered when sam¬ 
pling, but life stages and size as well. For plants, 
larger perennial plants are much more likely to be 
detected than smaller ephemeral plants. (Plant scale 
differences can be dealt with by changing the plot 
size as discussed later) (see also Chapter 3). For ani¬ 
mals, one must cover all the areas where the dif¬ 
ferent life stages of a given animal (in the case of 
abundance estimations) or all species (in the case 
of the estimation of diversity) might be. This may 
require the use of different sampling methodolo¬ 
gies for each stage or require that sampling efforts 
be carried out in different seasons or at different 
times of the day. For example, to detect the pres¬ 
ence of Salvelinus confluentus (bull trout) in Pacific 
Northwest streams, juveniles are most often sam¬ 
pled near the substrate at night because this is 
when they are most commonly seen (Bonar et al. 
1997). The sampling design and methods for ani¬ 
mals with large habitats (or ranges) will necessarily 
be different from those of sessile animals or ani¬ 
mals with very low motility. The size of the species 
of interest can affect the sampling. It will take a 
much larger plot to sample a grove of Sequoiaden- 
dron giganteum (sequoias) than Bromus tectorum 
(cheatgrass). 


2.2.4 Where to sample 

Where should a biologist sample if he or she wants 
to know something about an underlying commu¬ 
nity or population? At first glance, this sounds like 
a simple question, but it can be complex. Below 
are tips to help the biologist select from where the 
sample should be taken. 

The target population is the population or eco¬ 
logical resource of interest. For example, the target 
population might be all sea urchins in the waters 
off San Juan Island, Washington, mule deer on the 
Kaibab Plateau, or plant communities of the north¬ 
east Siberian coastal tundra. The sampling frame is 
the physical representation of the target population. 
It represents what or where the samples are taken 
from to characterize the target population. For the 
above examples the sampling frame might be the 
littoral zones of the San Juan Island from the shore 
to 20 m deep at high tide, the Kaibab Plateau, or the 


montane and lowland tundra distributed between 
the Laptev and Chukote Seas, respectively. 

For best results, the sampling frame should mir¬ 
ror or represent the target population as closely as 
possible. For example, say the biologist's objective 
was to estimate the proportion that each aquatic 
macrophyte (vascular plant) species contributed to 
the total macrophyte biomass in Lake Taupo, New 
Zealand. Because most macrophytes will be found 
in depths of less than 15 m, a sampling frame 
consisting of the littoral zone up to a depth of 
15 m should adequately bracket the community. 
However, if the sampling frame consisted only of 
sampling sites less thar\l m deep, it would not ade¬ 
quately encompass the target population of macro¬ 
phytes. Sampling in waters less than 1 m deep 
would potentially overestimate the proportion of 
shallow water emergent plant species biomass in 
the entire community and would not meet the 
research objective. 

Sometimes the sampling frame is obvious. The 
area bounds are established along jurisdictional or 
management boundaries as specified by the project 
funding (e.g. Saguaro National Park, Posey County, 
Indiana, the island of Barbados, or the site of the 
new subdivision). In some situations defining the 
sampling frame can be challenging. Perhaps an esti¬ 
mate of Cancer magister (Dungeness crab) density is 
needed for the Pacific Ocean off Washington State, 
USA. Three boundaries of the sample frame are 
easy to determine: the frame is bounded on the 
south at the Columbia River at the Oregon border, 
at the north at the middle of the Strait of Juan de 
Fuca on the Canadian boundary, and at the east 
by the waterline of the Washington coast. However, 
defining the western limit of the sampling frame, 
how far offshore to sample, is less clear. Cancer 
magister are not found in the deepest abyss, but 
become less dense in progressively deeper water. It 
would make no more sense to sample C. magister 
in a deep ocean trench than selecting sites contain¬ 
ing no people (such as a wilderness area) when 
surveying human population characteristics. Using 
the biological characteristics of the C. magister (it is 
rarely found in water deeper than 180 m) one can 
set an outer depth limit to the sampling frame. 

The sampling frame may be restricted to where 
sampling gear can be used effectively, even though 
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the organism may be found outside the frame. For 
example, a common method to sample freshwa¬ 
ter fish in lakes is electrofishing, but electrofishing 
cannot generally be used to capture fish in water 
deeper than about 3 m. The sampling frame for 
electrofishing sites in lakes would include segments 
along shorelines up to 3 m deep, not transects in 
deep offshore areas where there would be little 
chance of catching fish by this method. Often the 
samples taken from such a frame may not reflect 
the entire community diversity or abundance, but 
can provide an important index for trends or similar 
work. 

Because the sampling frame can vary among 
studies, the frame must be defined and reported in 
any report or publication. This is especially impor¬ 
tant if other biologists are not familiar with the type 
of survey and cannot assume its sampling frame. 

2.2.5 Bias, sampling error, and precision 

A biologist known by one of the authors wasted 
2 years of his working life and 4 years of a study, 
costing his project sponsors thousands of dollars, 
because he did not know the fundamentals of sam¬ 
pling procedure. The biologist was tasked with 
studying how littoral (shallow water) fish popula¬ 
tions changed following reduction in aquatic plant 
communities. He was to record the abundance of 
various fish species in the littoral zone for 2 years 
before the plants were removed and 2 years follow¬ 
ing. The 2 years of samples prior to plant removal 
had already been collected by others using spring¬ 
time electrofishing. The biologist had other tasks in 
the spring, so he decided to collect his electrofishing 
samples in the autumn. Unfortunately, unknown to 
the biologist, this was a fatal flaw for his study. Dif¬ 
ferent species and sizes of fish use the littoral zones 
of lakes in the spring and in the autumn. The sam¬ 
pling conducted pre treatment in the spring could 
not be compared to the sampling conducted by the 
biologist post treatment in the autumn because the 
effects of plant control could not be separated from 
the effects of seasonal differences in fish use of the 
littoral zone. The biologist failed to consider bias. 
Bias, and also the concepts of precision, sampling 
error, and accuracy shouleF be known by all who 
collect samples. 


The terms sampling bias, precision, sampling 
error and accuracy are all related to the funda¬ 
mental question in survey design: 'How well does 
the sample estimate parameters of the target pop¬ 
ulation?' Accuracy is the closeness of agreement 
between an observed value and an accepted refer¬ 
ence value (Locke 1994) and is a function of how 
much bias and sampling error is in the sample. Bias 
and sampling error are often confused, but they are 
different. 

A sample is biased when, to some degree, it 
does not represent the population from which 
it was taken. Bias can be subdivided into mea¬ 
surement bias and sampling bias. Measurement 
bias occurs when measurements are taken incor¬ 
rectly. Perhaps the biologist was sampling lizards 
and did not set traps correctly, so the abundance 
of lizards was underestimated. Perhaps a poorly 
trained crew recorded birds seen in only a por¬ 
tion of the area of interest, biasing the study 
towards those birds found only in that portion. 
Perhaps a fish-measuring board started at 5 mm 
instead of Omm, so all fish lengths were over¬ 
estimated. These are examples of measurement 
bias, which can be reduced by careful crew train¬ 
ing and ensuring sampling equipment is prop¬ 
erly calibrated and working correctly. Sampling 
bias occurs when the sample does not include all 
groups of interest in the population. Stated in 
another way, this means that a sampling method 
does not capture all organisms equally. Every sam¬ 
pling method has inherent sampling biases that 
can affect estimations of animal diversity and 
abundance (Willis & Murphy 1996; Krebs 1999; 
Southwood & Henderson 2000) 

Sampling or method-specific biases need to be 
understood to adequately estimate the accuracy of 
animal diversity and abundance values. Sampling 
can be biased by organism size. A net used in esti¬ 
mations of fish diversity or abundance, for exam¬ 
ple, can have a mesh size that very small fishes 
can escape through, therefore the net is said to be 
biased towards larger fish, that is it captures larger 
fish in a disproportionately greater amount than 
they occur in the population as a whole, so the 
mean fish length of the sample will be greater than 
that of the population. Other methods or a smaller 
mesh size may be effective for capturing smaller 
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fish, but they will have their own limitations (Lyons 
1986; Mercado-Silva & Escandon-Sandoval 2008; 
Rabeni et al. 2009). Sampling can also be biased 
towards particular species. A trawl might cap¬ 
ture smelt effectively because of their schooling 
behaviour in the water column, but Micropterus 
salmoides (largemouth bass), which are associated 
with the lake bottom and structure, may be harder 
to capture, therefore the gear is biased for smelt. 
Depending on the species, the differing amounts 
of bias present can be difficult or impossible to 
measure, and present particular problems when 
evaluating diversity using species proportions (e.g. 
Shannon or Simpson's index). Those species which 
are easier to capture can be overrepresented in 
the indices (see further discussion of this point 
in Chapter 3). 

Size and species bias are also present in plant 
sampling, as each sampling method is typically bet¬ 
ter at detecting some groups of plants than others. 
In most instances, larger and more persistent plants 
are much more likely to be detected than smaller, 
more ephemeral, or juvenile plants. Juvenile plants 
of species judged important can be especially prob¬ 
lematic in that they can be difficult to detect com¬ 
pared to adults because all plants are small early in 
their lifecycle. Plant scale differences can be dealt 
with by changing the plot size, as will be discussed 
later. 

Bias can make comparisons with the true popula¬ 
tion difficult, but there are some ways to account for 
bias in a study. Ensuring crews are properly trained 
and biologists are familiar with the amount of bias 
that may occur using a particular sampling gear 
aids in the evaluation of the importance of bias in 
surveys. 

If bias is the same over time (trend surveys) or 
space (for status surveys) the sample data can be 
used for comparison or to follow trends. A trawl 
net could be used to monitor fish population trends 
in a lake over time, under similar conditions, with 
the understanding that fewer small fish would be 
captured than larger fish in each sample, but abun¬ 
dance of all might increase or decrease over time. 
However, if a biologist uses a gill net one month to 
catch fish and the next month uses electrofishing for 
trend monitoring, the bias would be different for 
the two gear types and the samples could not be 


compared. The biologist would not know if trends 
in the data were due to the actual changes in the 
population or just differing gear bias. Similarly, if 
trawling was conducted in very different habitats, 
or in different seasons of the year when fish behav¬ 
iour varied, the biologist would not know if dif¬ 
ferences in capture rates were due to how well the 
trawl fished in the different habitats or time periods, 
or in the actual numbers of fish present. This is 
the error the biologist made in the example above. 
The bias of the electrofishing surveys was not the 
same in the spring as in the autumn in the lake in 
question. 

If the objective is to capture a diverse sample, 
a combination of methods should be used to can¬ 
cel out the bias j of the different sampling gears as 
much as possible. This should be tried to collect 
a wide range of (1) species with different habi¬ 
tat use, (2) species with different behaviour, and 
(3) different life stages of a given species with 
a specific size and ethology. The sampling meth¬ 
ods selected should cover all possible niches that 
exist in an ecosystem. Lethal methods such as the 
application of poison or explosives in fish studies, 
although they are destructive, may in some cases be 
the only alternatives to fully sample a community. 
Still, even these may fail at capturing certain ele¬ 
ments of the community (e.g. fish eggs resistant to 
poison effects, individual fish that die but cannot be 
captured). 

Perhaps the best methods to correct for sampling 
bias involve double sampling while surveying or 
conducting preliminary studies that calibrate gear, 
that is evaluating sampling gear efficiency for the 
studied species under a wide variety of conditions 
(Peterson & Paukert 2009; Chapter 3). In double 
sampling a randomly selected subset of samples 
from the overall survey are sampled twice: once 
using the sampling method and then using an unbi¬ 
ased population estimator, such capture-recapture 
for species richness or abundance (Williams et al. 
2002) or an occupancy estimator for species detec¬ 
tion (MacKenzie et al. 2002). The 'true' population 
estimates are then regressed against the samples 
and correction factors are developed for samples in 
the survey that were not double-sampled (Box 2.1). 
Gear efficiency studies are more labour inten¬ 
sive than double sampling as they use abundance 
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Box 2.1 An example of correcting for bias and estimating precision 


Micropterus dolomieu (smallmouth bass) are collected 
using backpack electrofishing at 50 block-netted sites to 
estimate their mean abundance in a stream. To correct for 
sampling bias, a randomly selected subset of 15 sites is 
selected for double sampling. In each of the double-sample 


sites, M. dolomieu catch per unit effort (CPUE) and an 
unbiased population estimator (capture-recapture 
estimate) of abundance are determined. Next the 
abundance estimate is regressed against CPUE to calculate 
a correction for all samples. 



Electrofishing CPUE (fish/100 m) 


All 50 samples are then converted from CPUE to 
estimated abundance using the regression equation 
developed above. How precise is the mean abundance of 
M. dolomieu in this stream? The mean abundance and 
associated standard deviation of the 50 converted samples 
can be calculated using formulas from any standard 


statistics text or computer software. Notice that there is still 
variability that was not captured during the conversion of 
CPUE to estimated abundance. More variability could have 
been accounted for by developing a more labour-intensive 
gear calibration model, accounting for the effects of habitat 
complexity and other factors. 


estimates obtained through capture-recapture or 
removal methods (which can have bias associated 
with them as well) to calculate the efficiency of the 
gear in a separate study before the survey. Gear effi¬ 
ciency depends on species-specific attributes such 
as capture avoidance, size, patterns of aggregation, 
and habitat complexity. Gear efficiency studies help 
to determine the true rarity of a given species, 
as some sampling methodologies, being relatively 
inefficient at capturing a certain species, may give 
the false idea that a species is less abundant than it 
actually is (Lyons 1986; Bayley & Herendeen 2000; 
Longino et al. 2002; Chapter 3). If gear efficiency is 
not calculated, sometimes a lower threshold of gear 


efficiency can be used, which is based on pilot sur¬ 
veys or the literature. Bonar et al. (1997) assumed 
a lower threshold of 25% snorkelling efficiency for 
encountering the Salvelinus confluentus (bull trout) 
that were present in Pacific Northwest streams. This 
was used in the calculation of sample sizes needed 
to detect S. confluentus if they occurred in actual 
densities of less than 0.15 fish/100 m of stream. 
Even if no correction is used for bias, it is impor¬ 
tant to understand the limits of sampling methods 
being used and create correction factors between 
sampling methods. These could be used to calculate 
method bias that should be accounted for in data 
interpretation. 
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Sampling error is the inverse of the amount 
of precision in an estimate. Unlike sampling bias, 
sampling error is easily measured, quantified, and 
reported as variability, standard deviation, standard 
error or some other measure of dispersion. For 
example, an estimate is needed of fish density in 
a lake. A trawl net that samples a specific volume 
of water is used to catch the fish. Fish are spread 
evenly throughout the lake, and on six sequen¬ 
tial trawls 5, 6, 4, 5, 6, and 3 fish are captured, 
respectively. The average or mean number of fish 
per trawl sample is 4.8. Here the biologist may 
have a lot of confidence that the true mean of the 
fish population available to trawling (excluding any 
bias) is close to 4.8. Sampling error is low (0.48) 
and the sample is precise. If the fish are distrib¬ 
uted in clumped groups throughout the lake, the 
six sequential trawls may capture 2,11,0,14,0, and 
2 individuals, respectively. There is more 'spread' 
or experimental error in the later estimate (with an 
error of 2.48) and the average (4.8 fish/trawl) cal¬ 
culated from this estimate is less precise. The biolo¬ 
gist is less confident that this average represents the 
actual population mean, again assuming there is no 
bias. 

One way to illustrate precision and bias is by 
reference to targets (Fig. 2.1). The bull's eye of the 
target is the true population parameter. For the first 
target on the left, a tight cluster of samples to the 
right of the bull's eye is precise (the samples are 
tightly clustered) and biased (the samples are not 
centred on the target but are off to the side). A loose 
cluster of samples to the right (second target) is 
imprecise and biased. A loose cluster of samples in 
the middle is imprecise and unbiased (third target). 
A tight cluster of samples in the centre of the target 
is precise and unbiased (fourth target). 

Now we know our objectives, what species we 
will sample, where we will sample, and some of 
the factors that can affect sampling surveys, such as 
bias and precision, we will discuss how to sample 
and how much to sample. 

2.2.6 How to sample 

Nets, plots, traps, bird-calling surveys, dredge sam¬ 
ples: what can be used to quantify the animal 
or plant population? Below we discuss important 


considerations about selecting gear types or plots 
when sampling. Often previous studies or refer¬ 
ence books of sampling techniques (e.g. Bonar et 
al. 2009b) can be used to choose an appropriate 
method for sampling the species of interest. 

The minimum size unit in sampling is called 
a sampling unit. For animal surveys, the sampling 
unit could be a location for a pit-fall trap set, a 
transect for sighting birds, or a shoreline length 
where all frogs are captured. For plants, the sam¬ 
pling unit is often a plot. A subset of sampling 
units is drawn from the sampling frame in a. way 
to ensure the subset represents the area of interest. 
Units to be sampled within a sampling frame are 
usually chosen randomly, which means that every 
sampling unit has an equal chance of being selected. 
They can also be assigned systematically, which 
means using a random starting point and taking 
a sample at every nth unit. Some type of random 
allocation of sampling units is almost always pre¬ 
ferred, but systematic sampling can be equivalent 
to random sampling if the ordering of the indi¬ 
viduals is independent of the attribute being mea¬ 
sured. Descriptions of various commonly used sam¬ 
pling designs, such as simple random sampling, 
stratified random sampling, systematic sampling, 
adaptive sampling, and cluster sampling, are pro¬ 
vided in sampling texts (Cochran 1977; Williams 
et al. 2002; Thompson 2004; Scheaffer et al. 2006; 
Bonar et al. 2009b). 

Researchers must be careful to adopt sampling 
methodologies that will minimize the effects on 
individuals, populations, and habitat. The humorist 
Don Novello wrote a comic letter to NASA, which 
was testing for life on Mars in the mid-1970s by 
burning a small sample of Martian soil and test¬ 
ing for carbon residue. Novello wrote 'That doesn't 
mean there is life on Mars—that means there was 
life on Mars—You killed it!' In the past, it was com¬ 
mon to sample animal populations using destruc¬ 
tive techniques. For example, toxicants were widely 
used to sample fish communities in a variety of 
ecosystems. Today these methodologies are often 
discouraged, although they are still in use for a vari¬ 
ety of management purposes (Bettoli & Maceina 
1996). Techniques that destroy habitats (e.g. bottom 
trawling for fishes or marcoinvertebrates [Freese 
et al. 1999]), change animal behaviour, or hurt 
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Figure 2.1 Depiction of bias and precision, with centre of target representing true population value. Far left figure shows sample that is biased and 
precise, sample on second target is biased and imprecise; sample on third target is unbiased and imprecise; sample of far right target is unbiased and 
precise. 


animals (i.e. gill nets [Murphy & Willis 1996]) 
should also be carefully considered before using. 
When possible, observation and other sampling 
techniques that do not involve the taking of individ¬ 
uals may be good options to estimate the diversity 
and abundance of certain species, especially those 
easy to identify. However, sometimes it is necessary 
to take a small number of individuals to provide 
data to protect the population as a whole. 

Pilot surveys are small studies carried out 
before the initial survey, and are an indispens¬ 
able component of most surveys. They can be 
used to help identify the species present, the 
scale at which organisms occur, and the hetero¬ 
geneity of populations for sample variance esti¬ 
mates in sample size calculations. For example, in 
S. confluentus surveys, Bonar et al. (1997) recom¬ 
mend that an informal snorkelling survey is used 
to document presence before extensive time and 
resources are spent designing an expensive statisti¬ 
cally intense study for the reach. Bonar et al. (1993) 
used pilot surveys to calculate the sample size 
needed to adequately estimate biomass of aquatic 
macrophytes. 

For animals, motility is perhaps the most impor¬ 
tant factor that determines which sampling method 
to use. Sessile or low-mobility animals are usu¬ 
ally detectable by a variety of active sampling 
methodologies and cannot easily evade detection. 
Although it is clearly possible to capture mobile 
animals using active methodologies, animals with 
moderate to high mobility can, and often will, 
attempt to evade being captured by active meth¬ 
ods. Passive methods (e.g. attractants, traps), obser¬ 
vation (e.g. direct observation, distance sampling). 


or techniques based on animal signs (e.g. food 
remains, tracks, scats) are often better suited to 
mobile animals. See Bonar et al. (2009b), Murphy 
and Willis (1996), and Krausman (2002) for various 
techniques to sample fish and wildlife populations. 
Other references such as Hauer and Resh (2006), 
and Sorensen et al. (2002) discuss sampling inver¬ 
tebrate populations. 

For plants, plots are usually the unit sampled, 
and the biologists must choose a plot shape and 
plot size. Square, rectangular, and circular plots, 
and transects (a one-dimensional rectangle (Pueyo 
et al. 2006)) have all been used to sample diver¬ 
sity. Squares and rectangles can be the easiest to 
delineate in the field because strings or tapes can 
be stretched between the corner points. Narrow 
rectangular plots are often recommended because 
they can capture more patchiness within each plot, 
although this will depend on the elongation of the 
plot and the homogeneity of the site. Circular plots 
for surveying plants can be difficult to set up in 
areas with trees and shrubs because plots are usu¬ 
ally marked by attaching a string to the plot cen¬ 
tre and moving it around in a circle. The move¬ 
ment of the string radius is obstructed by plants 
above the surveyor's height. Larger plots capture 
more different landscape elements and plant asso¬ 
ciations, but lose fine detail. This can be corrected 
by measuring small and large plants at different 
scales using nested plot designs (Fig. 2.2), including 
the Whittaker plot (Shimida 1984) or the modified 
Whittaker plot (Stohlgren 2007). More methods on 
determining the appropriate size of the plot will be 
discussed in the section on how many samples to 
collect below. See Stohlgren (2007), Bonham (1989), 
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Figure 2.2 Two common nested plot designs, (a) A simple geometric progression of plot sizes typical of a pilot study, (b) A Whittaker plot (Shimida 
1984), which is commonly used as well as numerous published modifications (Stohlgren 2007). 


and Elzinga et al. (1998) for various techniques used 
to sample plants. 

Detecting rare or elusive species sometimes 
requires specialized sampling designs (Thompson 
2004). For plants that are rare because they are 
concentrated in a small part of the area of interest 
(a clump or cluster), use two-stage sampling (e.g. 
Elzinga et al. 1998). In two-stage sampling, the plot 
is searched by walking, driving, or remote sensing 
until the clump of rare plants is discovered, and 
then the clump is subsampled. Adaptive sampling 
for plants and animals is similar (e.g. Smith et al. 
2004). Here biologists search for a species of interest 
at predetermined locations and if the species is 
found, they sample nearby. To search for rare plants,"* 
Poon and Margules (2004) recommend stratifying 
a region of interest using environmental variables 
and then noting in which strata populations of rare 
species are known. New searches are then concen¬ 
trated in similar environments in the same general 
location or in other geographical areas. 

2.2.7 Quantifying the sample 

When a sample is taken, what will the biolo¬ 
gist measure to quantify abundance and diver¬ 
sity? A measure of abundance is important because 
diversity is not the same as species richness and 
depends on the relative commonness and rarity of 
the species present. There are many options, for 
example the number of organisms, their weight, 
the amount of area they occupy, or their presence. 
Which of these factors are measured depends on the 
objectives of the study. 


Abundance measures tend to be more problem¬ 
atic with plants than with animals because plant 
growth can exhibit considerable plasticity under 
different environmental conditions and plants com¬ 
monly have both asexual and sexual reproduction 
strategies. With animals, abundance measures are 
made difficult because of movement, different life 
stages and sizes, and ontogenic changes. Common 
choices for measuring abundance include density, 
biomass, and cover (usually for plants). These data 
allow calculation of the many diversity indices 
currently available. Commonly used ones include 
Simpson and Shannon (Stohlgren 2007). 

Density is the number of individuals by species 
per unit area. The number of individuals is sim¬ 
ply counted within each plot. For animals, direct 
counts are often not possible and a method such 
as catch per unit effort, mark-recapture, removal 
technique or distance sampling is used to estimate 
the animal density in an area (Murphy & Willis 
1996; Buckland et al. 2001; Krausman 2002; Bonar 
et al. 2009b). Sometimes counts of animals in an area 
are so great that subsampling for density (e.g. zoo¬ 
plankton (Karjalainen et al. 1996)) or using biomass 
measures are easier (e.g. zooplankton, marine fish 
(Ware & Thomson 2005)). Some plant types do not 
fit well into counting density (sod-forming grasses, 
multi-stemmed shrubs, clonal trees, etc.) because 
determining which part represents an individual is 
a daunting and time-prohibitive exercise. Without 
digging up roots to look for a connection between 
individuals, it can be impossible to assess observa- 
tionally which plants are individuals. This problem 
expands as the total area covered by these plants 
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increases within the area of interest. While each sit¬ 
uation is somewhat unique, a rule stating that indi¬ 
viduals with known clonal propagation must be 
greater than a fixed distance apart to be considered 
individuals (e.g. for bunch grasses a separation of 
more than 10 cm) will allow data collection despite 
the uncertainty. 

An enticing but less useful alternative is gather 
frequency data (presence or absence) for each plot 
and then attempt to convert it to density. While 
this has had some success in the field (see Bonham 
(1989) for plants), it remains overly dependent on 
the assumption that the underlying distribution of 
organisms is random. However, for plants, as the 
plot size reaches or exceeds the size of a plant of 
interest, the density data collected are essentially 
frequency data, which can add complexity to the 
analysis. Determining if the plant is in the plot or 
not can sometimes be difficult because of the plant's 
plastic growth form. A typical rule is that if an indi¬ 
vidual plant's basal area is more than half in the plot 
and rooted in the plot it counts as being in that plot. 

Measuring areal coverage is usually conducted 
for plants and sessile animals such as sponges 
(e.g. Lauer & Spacie 2004) and coral (e.g. Gardner 
et al. 2003). Again, because of the growth plasticity 
of plants, areal coverage measurements can some¬ 
times be problematic. Using cover mitigates some 
of the difficulties with collecting density as well as 
being more or less independent of the scale of the 
plant in terms of the time needed to collect data 
for a plot—individuals do not need to be enumer¬ 
ated. Cover has been successfully used in many 
studies of diversity but it can make comparison 
between studies difficult because small differences 
in procedure can produce large differences in the 
data (see the section below). In addition to proce¬ 
dural differences, many vegetation protocols only 
advise measuring the surface layer of the cover, 
which discounts or dismisses those species that 
occur beneath the canopy. This gives dispropor¬ 
tionate weight to plants dominant at the time of 
observation. In some systems, the aerially dominant 
plant is relatively fixed throughout the season, as 
mentioned above, yet in other systems it varies 
from week to week throughout the growing season, 
which would change the*diversity measure based 
simply on the time of sampling rather than from 
more substantive differences. A simple fix for this 


difficulty with measuring cover is to observe the 
cover of all species independently of one another, 
but this takes more time and cannot be done with 
remote sensing or photography because it requires 
evaluation of each layer of cover. Remote sens¬ 
ing data can give information valuable to diver¬ 
sity studies in that the vegetation alliance can often 
be identified, but species other than some large 
plants and invaders with unique phenology cannot 
be usually identified (Gillespie et al. 2008). Remote 
sensing can also help with plot stratification and 
offer support for inferences about the extent and 
grain of the field sampling. 

Directly measuring the biomass of animals is 
usually a straightforward procedure and can vary 
from simple to time-consuming depending on the 
study. Individuals are weighed (either wet or dry 
weight depending on the study) and summed for 
the sampling unit; a subsample from the unit is 
weighed and a mean obtained that is multiplied 
by the number of individuals in the unit, or the 
group is weighed in mass. For plants the biomass 
option can be very time-consuming because rather 
than observationally collecting these data, plants 
are typically harvested, sorted, bagged, oven dried, 
and weighed. 

Conversion between the different measures of 
abundance between animals can often be con¬ 
ducted through regression or other types of analy¬ 
ses when animals of the same species have sim¬ 
ilar morphology. The plastic growth of plants 
(where individuals can mature in a range of sizes 
spanning orders of magnitude) makes conversion 
between different measures of abundance problem¬ 
atic. However, there are limited situations where 
the individual plants are similar enough to make 
cover density and biomass well correlated. Only 
evaluation of the actual conditions at the time of 
data collection can help to determine if a conversion 
between measures of abundance is well advised. 

2.2.8 When to sample 

Determination of diversity and abundance in 
animal communities is heavily influenced by 
seasonality, time of day in which a sample is taken, 
and the reproductive stage of a given taxon. Sam¬ 
pling efforts should occur during a season which 
allows for most taxa to be susceptible to being 
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captured. This includes considerations of migratory 
behaviour. Similarly, species' diurnal/nocturnal 
movement cycles should be taken into considera¬ 
tion when planning sampling procedures. Often, 
entire assemblages are most detectible at a given 
time of day. It is at these times when samples should 
be taken to maximize species richness. 

Timing is also a very important consideration for 
plant studies. Each growing season, the range of 
plants, their density, their distribution, and even 
their size can be different. Species can colonize or be 
extirpated from the area of interest in short periods 
of time. Even within a growing season, the pres¬ 
ence of perennial plants waxes and wanes, while 
this pattern defines the lifecycle of many annu¬ 
als and ruderal perennials. Within the herbaceous 
zone, switches in dominance are not uncommon 
as the season progresses, despite relatively stable 
plant populations. Some plants mature early and 
others later. A similar change can occur in woody 
communities, where dominance changes can seem 
sudden between seasons as later succession species 
in the community overtop those that arrived earlier. 
In deserts, plants can remain dormant in the seed 
bank for years until suitable growing conditions 
occur. An overarching statement about the appro¬ 
priate time period in which to observe diversity is 
not possible but instead depends on the within- and 
between-year site variabilities and how they relate 
to the larger question. 

2.2.9 How many samples to collect 

The size of sampling error depends on (1) the 
heterogenity among sampling units in the sam¬ 
ple frame and (2) the number of sampling units 
observed/collected (sample size) (Groves 1989); 
therefore, the more heterogeneity among samples, 
the more samples must be taken to estimate the 
parameter with a given degree of confidence. In 
addition, the more samples taken the better the 
chance of collecting rare species. Because of these 
principles, collecting more small samples is usually 
better than collecting fewer large samples. 

The most precise and accurate estimations of ani¬ 
mal diversity and abundance come from intense 
sampling on a system for a long time period. 
These studies reveal temporal variation of a species' 
appearance in collections, and have experienced 


specialists who can identify the collected species 
and know which species from adjacent areas might 
appear in their collections (Longino et al. 2002). 
More commonly, diversity or abundance studies are 
of limited duration or geographical scope, and need 
to employ efficient sampling methodologies that 
can provide accurate estimates with small sample 
sizes. 

Sufficient sample sizes are obtained in different 
ways, depending on study objectives. Sometimes 
sample sizes are previously set based on previ¬ 
ous studies. For example, in plant studies, no.mat¬ 
ter how large the plot size, at least five indepen¬ 
dent sampling units are recommended to allow 
enough plots to capture the variability (Stohlgren 
(2007) recommends seven). In other studies sam¬ 
ple sizes are calculated based on the accepted level 
of error in the estimate and the variability of the 
parameter estimated through a pilot study, previ¬ 
ous study, or, sometimes, an educated guess. The 
cost of sampling can be a factor in decisions. If 
developing 99% confidence intervals for an esti¬ 
mate requires 100000 samples, a biologist might 
have_to accept 95, 90 or even 80% confidence inter¬ 
vals for the estimate if it only requires 30-100 
samples and does not substantially affect survey 
objectives. Surveying references provide overviews 
of how to assign samples and calculate sample 
sizes for means, totals, and proportions using com¬ 
mon sampling designs, including simple random, 
stratified random, cluster, and systematic sampling 
(Bonham 1989; Elzinga et al. 1998; Magurran 2004; 
Bonar et al. 2009b). 

Advances in computer science and mathematics 
have resulted in substantial progress in the devel¬ 
opment of monitoring designs for trends that allow 
the biologist to maximize power by allocating sam¬ 
ples over space and time based on the variance 
structure of initial samples (e.g. King et al. (1981), 
Gibbs et al. 1998, Urquhart et al. (1998), Urquhart 
& Kincaid (1999), and Larsen et al., (2001, 2004)). 
Software programs such as MONITOR (Gibbs 1995) 
and TRENDS (Gerrodette 1993) are available to help 
the biologist allocate samples and maximize power 
in order to detect trends. 

Species accumulation curves to estimate the 
amount of sampling needed for diversity sampling 
have recently seen increased use. Here, plot size 
or the number of samples taken is recorded on 
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Figure 2.3 A example of a species area curve from nested square plots sampling perennial plants in the Sonoran desert, Arizona in 2007. If the plot area 
increases from 1 to 10 m 2 (point A), the plot captures twice as many species. Once the plot size increases to 20 m 2 (point B), the plot has doubled in area 
but only captured one additional species. As the area of the plot continues to increase (point C), the number of new species increases only very gradually 
as rare species are encountered. An efficient plot area in this example might be 20 m 2 . 


the tt-axis and the number of species collected 
is recorded on the y-axis (Fig. 2.3). Increasing the 
number of samples or plot size initially has a dra¬ 
matic effect on the number of species encountered 
(A). However, as more samples are taken or plot 


size is increased, the curve flattens where more sam¬ 
pling does not result in many more species collected 
(B). It is at the point where the curve levels that 
the number of samples to be collected or the plot 
size is optimized (C). Species accumulation curves 


Figure 2.4 A theoretical species area curve for a nested 
plot as it increases in area. The segment marked A is the 
traditional species area curve with the line flattening as all 
common species have been captured and occasional rare 
species are encountered. As the line enters segment B, the 
line shows the plot expanding out of the current vegetation 
association into a different type. For example, in the Sonoran 
Desert, Arizona, this could be moving from an upland site 
into riparian vegetation or onto a slope. If a large enough 
plot could be censused, this stair-stepped pattern would be 
repeated as the area Increases to encompass all of the 
vegetation associations in the area of interest. In practice, 
the part of the site represented in segment A would be 
stratified and sampled separately frorffthat in segment B. 
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for nested plots occur at different scales and can be 
more complex (Fig. 2.4). 

Given the difficulty of detecting rare species in 
assemblages, and the limitations that exist in car¬ 
rying out field surveys to estimate biodiversity, it 
is often necessary to use statistical techniques that 
can help in determining the amount of effort that 
would be necessary to capture the entire diversity. 
Three methods for estimating species diversity are 
(1) fitting a statistical distribution to rank abun¬ 
dance data, (2) extrapolating a species accumu¬ 
lation curve to its asymptote, and (3) estimat¬ 
ing the asymptotic number of species with non- 
parametric estimators (Longino et al. 2002). For 
individual-based (abundance) data, the area under 
a fitted, lognormal abundance distribution has 
been used to estimate the total number of species, 
including undetected rare species (Chao et al. 
2009). Other species abundance models such as 
the log-series, geometric, negative binomial, Zipf- 
Mandelbrot, and the broken-stick (Magurran 2004) 
can also be fit to abundance data to estimate asymp¬ 
totic species richness. Curve-fitting methods, which 
can be applied to both abundance data and inci¬ 
dence data, extrapolate a fitted function such as the 
Michaelis-Menten equation or a mixture model out 
to the asymptote of the species accumulation graph 
(Soberon & Llorente 1993; Colwell et al. 2004). Other 
parametric and non-parametric methods are useful 
in the estimation of biodiversity in a species assem¬ 
blage (Chao et al. 2006; Chao et al. 2009). 

2.2,10 Comparing information from different 
surveys 

The objective of studies on the diversity or abun¬ 
dance of animals is often to compare these metrics 
through time or from one area to another. Conver¬ 
sion techniques have been developed for compar¬ 
ing data collected using different techniques (e.g. 
Scheiner et al. 2000; Peterson & Paukert 2009), but 
these are often labour intensive and can introduce 
additional error. Methods used at different times or 
in different areas can also be standardized (Bonar 
& Hubert 2002; Bonar et al. 2009a,b). By using the 
same methodology in each sampling effort, it is 
possible to eliminate the variability introduced by 
modification of sampling methods, although vali¬ 


dation still needs to occur to relate the standard¬ 
ized sample to the true population. Standardization 
not only refers to the equipment used or how it is 
used, but also to other aspects of sampling such as 
timing of sampling, the habitats that are sampled, 
and effort. Care should be exercised to make data 
from independent collections as comparable to one 
another as possible. Standardization is useful even 
in cases when large-scale time or space comparisons 
are not the focus of a given study. Today, many data¬ 
bases exist that compile information from a variety 
of independent studies that were carried out using 
standard techniques. These often serve as a basis 
for large-scale studies. Standard methods for ani¬ 
mals such as fish (see Bonar et al. 2009b) have been 
developed, but standard methods of sampling plant 
diversity have been difficult to develop (Stohlgren 
2007) and general methods must be modified for 
each unique set of questions, abiotic-, biotic-, and 
budgetary-realities. 

** 

2.2.11 Preparing for the field 

While objectives are being set, and the survey 
design developed, there are additional tasks to 
consider. Often a permit from a natural resource 
management agency will be required to carry out 
sample. Always plan enough time to obtain the 
permit. For example, obtaining federal or state per¬ 
mits to sample endangered species in some areas 
of the USA can take up to a year, and because 
diversity or abundance surveys are often used to 
quantify a species in peril, long wait times can 
be the norm. After permits have been obtained, 
the appropriate landowners and natural resource 
agency personnel should be notified to inform them 
of specific dates when sampling will take place. If 
notification does not occur, law enforcement per¬ 
sonnel are often called out to the site to investigate 
the 'suspicious activity' occurring. Field sampling is 
frequently conducted in remote locations using spe¬ 
cialized equipment. Project logistics and the salaries 
of surveyors are usually the most expensive parts 
of any project. Checklists should be used to ensure 
that all field equipment is ready and loaded before 
the survey, and contingency plans (such as hav¬ 
ing a spare set of equipment available) for when 
equipment failure occurs should be developed to 
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avoid unnecessary expense. When surveying a field 
site, it is important to move quickly and accurately, 
minimizing unnecessary breaks and ignoring minor 
discomfort to complete the site on schedule. Above 
all, the emphasis should be on safety. Surveys 
should not be continued if dangerous weather con¬ 
ditions develop, sampling equipment poses a risk, 
or other conditions associated with the sampling 
are compromised. 

2.3 Prospectus 

Proper planning of surveys and consideration of 
fundamental issues such as bias, accuracy, and 
precision can help ensure a useful, valid study 
(Box 2.2). Biologists who initiate surveys without 
considering the basics of survey design risk embar¬ 
rassment and the outlay of considerable funds with 
few results. Those who carefully plan their surveys 
from the start will provide information that can 
advance science, influence politics, or shape laws 
and policies. 


Box 2.2 Guidelines for sampling 

• Develop clear, detailed objectives for the survey. 

• Define the sampling frame—select what, where, and 
how to sample. 

• Seek advice from statisticians when designing surveys. 

• Consider how bias and sampling error will affect 
estimates. Correct for bias (calibrate sampling) if 
possible and maximize numbers of samples collected 
and allocate samples in time and space to reduce 
sampling error. 

• Incorporate some form of randomization when 
selecting samples. 

• Chose methods to sample that minimize impact to the 
organisms studied. 

• When possible, sample using standard techniques so 
results among studies can be compared. 

• Incorporate time and planning for obtaining sampling 
permits and conducting safe surveys, and develop 
contingency plans for when things go wrong in the 
field. 

• When in doubt about survey design or logistics, refer 
back to survey objectives. 


Comparison across large regions and commu¬ 
nication among diverse researchers are becom¬ 
ing increasingly important, therefore the adoption 
and development of standard sampling techniques 
will play even a greater role in the future, sim¬ 
ilar to the standard methods already developed 
for climate science, water chemistry, geology, and 
medicine. 

With the advent of technology, analysis pro¬ 
cedures, and automation, many sampling proce¬ 
dures will become easier over time. However, 
the basic considerations will remain the same. A 
fundamental knowledge of basic sampling issues 
will help those surveying for biological diver¬ 
sity, no matter what tools are available in the 
future. 

2.4 Key points 

1. Set clear objectives for surveys, with a carefully 
defined sampling frame. 

2. Design the survey, seeking help from a statis¬ 
tician when needed. Define when, where, and 
how to survey, using methods that minimize 
mortality to surveyed organisms and incorpo¬ 
rate some form of randomness. 

3. Precision and bias affect the utility of sampling 
estimates. Account for the effects of bias by using 
the same techniques over time to monitor trends, 
using a variety of gears with different bias to 
cancel species-related bias for point estimates, 
or, the best way, by validating sampling tech¬ 
niques with true population parameters. Maxi¬ 
mize precision by increasing sample sizes and 
careful consideration of sample allocation over 
space and time. 

4. Pilot surveys can help define the sampling effort 
required and identify the logistical challenges 
the main survey will face. 

5. Sample using established standard sampling 
methods when possible to maximize the compa¬ 
rability of the data among studies. Ensure safe 
surveys and use field etiquette, including obtain¬ 
ing sampling permits in advance, to maximize 
survey efficiency. 
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Biodiversity monitoring: 
the relevance of detectability 

Stephen T. Buckland, Angelika C. Studeny, Anne E. Magurran, 
and Stuart E. Newson 


3.1 Introduction 

Most studies that monitor biodiversity are deficient 
in an important respect: they assume that the cap¬ 
tured or observed animals or plants form the com¬ 
munity of interest, or at least are a representative 
sample from it. For single-site monitoring, if the 
site is small and individuals of the taxa of inter¬ 
est are easily detected, it might not be unreason¬ 
able to conclude that the counts correspond to the 
community of interest. However, for more cryp¬ 
tic species and/or larger sites, complete enumera¬ 
tion of individuals is seldom feasible (Coddington 
et al. 2009). Most diversity measures assume that 
the individuals we count are a random sample of 
the community (Magurran 2004, pi 36). It might be 
possible to ensure this if sampling is conducted 
on a sufficiently large random sample of plots 
within the site, where each plot is small enough 
to allow complete enumeration of individuals on 
the plot. If this is not achievable, we can expect 
heterogeneity among individuals in probability of 
detection. 

Increasingly, in part due to the change of empha¬ 
sis for biodiversity monitoring signalled by the 2010 
Biodiversity Target of the Convention on Biological 
Diversity (which states that there should be 'a sig¬ 
nificant reduction of the current rate of biodiversity 
loss' by 2010), the focus is on quantifying biodiver¬ 
sity trends within regions or countries, in which 
case the community should be the entire biota of 
the region or country. Since we cannot record the 
entire biota directly, we must sample locations and 
species (Chapter 2), with the aim of drawing infer¬ 
ence on the entire biota. If survey sites or plots are 


selected according to a randomized scheme, such as 
the stratified random scheme of the UK Breeding 
Bird Survey (Newson et al. 2005), the issue of repre¬ 
sentativeness of sites is addressed. Representative¬ 
ness of species is more problematic, especially when 
complete species lists do not exist, and some species 
and taxa arejnuch more difficult to survey than 
others. Hence, biodiversity monitoring will always 
be based on a reduced set of taxa. Within that set, a 
random subset of species might be an appropriate 
means of easing the task of monitoring. 

Even after addressing these issues, we need to 
consider the issue of detectability, unless our plots 
are so small that all individuals can be detected 
and so numerous that all, or nearly all, species 
occur on at least one sample plot. We use the 
term 'probability of detection' for the probabil¬ 
ity that an individual is recorded. Recording of 
individuals might be achieved by visual or aural 
detection, or by capturing them in traps, nets, etc. 
A combination of methods might be used. Typi¬ 
cally, we can expect detectability to vary accord¬ 
ing to a number of factors, but the one that inter¬ 
ests us here is when detectability of individuals 
varies among species, as this will bias the species 
proportions used in many biodiversity measures 
(Yoccoz et al. 2001). 

In this chapter, we consider the implications 
when detectability varies by species, with partic¬ 
ular reference to monitoring regional biodiversity 
trends. We consider some measures of biodiversity 
that are potentially useful for quantifying trends in 
regional biodiversity and explore how these mea¬ 
sures are affected when the issue of detectability is 
ignored. 
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Although our focus is on regional monitoring, 
there are parallels with site monitoring. For exam¬ 
ple, we might seek to quantify the biodiversity of a 
site by sampling a number of small plots within the 
site. In this case, we do not require that each plot 
is sufficiently large to reflect the entire biodiversity 
of the site, but we expect the plots collectively to 
achieve this. If it is possible to make each plot suffi¬ 
ciently small to allow complete enumeration of the 
plot, then the issue of detectability of individuals 
on the sample plots becomes irrelevant. If, however, 
there are too few plots, many species present in 
the site will not be detected because they do not 
occur on any of the plots. Thus we need a large 
sample of plots, but we do not need large volumes 
of data from each plot, to quantify reliably the bio¬ 
diversity of the site. The same issues apply when 
we seek to quantify the biodiversity of a region by 
sampling many sites within a region. Note that, for 
a regional survey, we might adopt a hierarchical 
sampling scheme, with a representative set of sites 
through the region and a representative set of plots 
within each site. 

We focus primarily on detectability of individu¬ 
als in this chapter. However, as alluded to above, 
there is also an issue of detectability of species: if 
detectability of individuals of a species is very low, 
or if a species is so rare that few, if any, occur in 
the sampled sites or plots, then the species may 
be undetected (Chapter 4). In statistical terms, if 
there is an adequate sample of plots or sites, this 
is not a problem, as unrecorded species will repre¬ 
sent a small proportion of individuals in the com¬ 
munity; the estimated proportion will simply be 
zero if no individuals are detected. What consti¬ 
tutes an 'adequate sample' of plots or sites depends 
on the spatial heterogeneity of the environment 
and on heterogeneity in the detection probabili¬ 
ties (greater heterogeneity requires a larger sam¬ 
ple size in both cases). It also depends on the size 
of sample plots/sites, especially if the site/region 
is spatially homogeneous with respect to its bio¬ 
diversity. (In a very heterogeneous environment, 
we would prefer a large sample of small plots 
to a small sample of larger plots, as we need to 
sample more locations to quantify biodiversity of 
the site or region. In a More homogeneous envi¬ 
ronment, there would be less gain from sampling 


more locations and more gain from making plots 
larger.) 

3.2 State of the field: which 
biodiversity measure? 

In choosing our biodiversity measures, several con¬ 
siderations must be taken into account. First, we 
need to choose the component(s) of biodiversity 
we wish to measure. Buckland et al. (2005) list six 
criteria that we might like our measure to satisfy 
(Table 3.1). The non-statistical criteria are related to 
three components of biodiversity: species richness, 
species evenness, and abundance. The statistical cri¬ 
teria are that the measure should have good and 
measurable precision, and have an expectation that 
does not vary with sample size. 

Another consideration is whether trends in bio¬ 
diversity measures should be sensitive to trends in 
the rarest species of a community. Rare species are 
an important positive contributor to species rich¬ 
ness, although they make it difficult to estimate 
(Colwell & Coddington 1994; Chapter 4). However, 
an increase in the number of rare species reduces 
species evenness and rare species generally have a 
negligible contribution to abundance. 

If rare species are excluded from analyses, the 
geometric mean of relative abundance indices, as 
used by Gregory et al. (2003), has good properties. 

Table 3.1 The six criteria proposed by Buckland et al. (2005) for a biodi¬ 
versity measure. 

Criterion Description 

1 For a system that has a constant number of species, overall 

abundance, and species evenness, but with varying 
abundance of individual species, the index should show no 
trend. 

2 If overall abundance is decreasing, but number of species and 

species evenness are constant, the index should decrease. 

3 If species evenness is decreasing, but number of species and 

overall abundance are constant, the index should decrease. 

4 If number of species is decreasing, but overall abundance and 

species evenness are constant, the index should decrease. 

5 The index should have an estimator whose expected value is 

not a function of sample size. 

6 The estimator of the index should have good and measurable 

precision. 
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meeting all the criteria of Buckland et al. (2005) 
except one: if the number of species is declin¬ 
ing, the index fails to reflect this, as the rare 
species are excluded. If they are included, zero 
counts or abundances are problematic. They can 
be replaced by small positive values to allow 
the index to be calculated, but its variance is 
likely to be high when rare species are analysed 
(Buckland et al. 2005). 

Note that 'rare' species are often defined prag¬ 
matically to be those species for which few if 
any individuals are detected by the sampling 
programme, so that estimation may be compro¬ 
mised, as for the above method using the geo¬ 
metric mean. Ecologically, it seems unsatisfactory 
for 'rare' to be determined by sampling effort, 
rather than by, for example, the proportion of indi¬ 
viduals in the community that belong to a given 
species. 

Whether rare species are included or not, Simp¬ 
son's index performs well with respect to five of 
the six criteria of Buckland et al. (2005). The one 
criterion for which it fails relates to abundance: if 
all species are declining at the same rate, the index 
remains constant. Strictly, its expectation is a func¬ 
tion of sample size. If the form of the index used 
is 1 - D, where D = YlPf ar) d p, is the estimated 

i 

proportion of the community that is of species i, 
it can be adjusted to ensure that it is an unbiased 
measure of 1 — pf, where p, is the true propor- 

i 

tion of the community that is of species i. The 
adjustment is to multiply the index by n/(n — 1), 
where n is the total number of individuals detected 
(Lande et al. 2003). However, this assumes that the 
rt individuals detected are a simple random sam¬ 
ple of the N individuals (across all species) in the 
community for which we wish to draw inference. 
Even if we were interested only in local biodiver¬ 
sity, it is usually implausible that detectabilities or 
trappabilities will be the same across species. In 
the case of quantifying regional biodiversity, the 
correction has even less relevance. As n is typi¬ 
cally large, the issue is academic: the multiplier 
ls so close to unity that it makes no practical 
difference. 

The Shannon index also generally performs well 
with respect to five of the six criteria of Buckland 


et al. (2005). Like Simpson's index, it stays constant 
if all species are declining at the same rate. The 
Shannon index is also inferior to Simpson's index 
with respect to the dependence of its expectation 
on sample size. Its bias is —(s — 1 )/n, where s is 
the total number of species in the community (and 
is generally unknown); hence, for large n, the bias 
can generally be considered negligible. However, if 
rare species are included in monitoring, and these 
account for a significant proportion of total abun¬ 
dance, then the Shannon index violates the criterion 
that its expectation should not be a function of sam¬ 
ple size (Lande et al. 2003). 

Buckland et al. (2005) proposed a modified form 
of the Shannon index that satisfies all six of their 
criteria, provided n is appreciably larger than s. 
However, it has been constructed simply to sat¬ 
isfy the criteria, without any theoretical founda¬ 
tion, and its performance has not been rigorously 
assessed. In practice, it is generally preferable to 
use different indices to reflect the different ele¬ 
ments of biodiversity, assumed here to be species 
richness, species evenness, and abundance. Only 
if a single headline index of biodiversity change 
is needed might the modified Shannon index be 
considered. 

A related issue relevant to monitoring regional 
biodiversity is that of a and |3 diversity (Chap¬ 
ter 6). Over large regions, much of the diversity 
will be due to (3 diversity; that is it arises because 
species composition and relative abundances dif¬ 
fer appreciably among locations within the region. 
Site-specific diversity measures will reflect oc diver¬ 
sity only. By analysing data pooled across sampling 
locations (in which either species proportions p, are 
averaged across sites or counts or estimated abun¬ 
dances are summed across sites), y diversity is esti¬ 
mated. oc diversity is estimated by averaging site- 
specific measures across sites, possibly weighting 
by number of individuals detected or trapped at 
each site, and 3 diversity is estimated as either the 
difference between y and oc diversity (additive 3 
diversity) or the ratio of y to oc diversity (multi¬ 
plicative 3 diversity). If either the Shannon index 
or Simpson's index is used, estimated 3 diversity is 
guaranteed to be positive using the additive defin¬ 
ition and to be greater than unity using the multi¬ 
plicative definition. 
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3.3 Detectability: are species counts 
relevant for monitoring biodiversity? 

3.3.1 Individual detectability 

All too often, numbers of individuals detected are 
substituted into equations for measuring diversity 
with little thought given to whether those numbers 
give biased estimates of the quantities of interest. 
Much of the theory underlying biodiversity mea¬ 
sures assumes that all individuals in the commu¬ 
nity are equally likely to be sampled, irrespective of 
species. In real communities, this will seldom be the 
case. Some species will be inherently more easily 
detected or trapped than others. If detectability is 
not estimated, then biodiversity measures will be 
biased (see Box 3.1). The theoretical underpinnings 
claimed for some measures of biodiversity provide 
little reassurance in reality. 

Let us explore the implications if issues of 
detectability are ignored. We consider the case 
where we are interested in biodiversity of a region, 
but the same arguments apply if we are simply 
interested in a single plot or site. Suppose for a 
given survey region we have: 

s = number of species in the region 
Nj = number of individuals of species i 
in the region, i = 1,,s 
rii = number of individuals of species i 
detected in the region 
Trj = probability of detection of an indi¬ 
vidual of species i with estimate £•, 

£(«,) = N,tt, = expected number of indi¬ 
viduals of species i detected 
N, = rij / 7r,- = estimated number of indi¬ 
viduals of species i in the region 
Pi = H / N, , with corresponding 

estimate p, = H/T.U i $ 

(ji = E(rij)/Y!i=\ E(«/), with correspond¬ 
ing estimate <?, = n,j £, =1 % 

When the true Nj are not known, diversity 
indices such as Simpson's index (D = £1., p} or 
transformations of this) and the Shannon index 
(H = - £? =1 p, log e p, ) should be estimated by sub¬ 
stituting estimates pi of p, , which requires estimates 
Ttj. Instead, standard prdftice is to evaluate them by 


substituting q, for p,. If we can assume that »; = n 
for all i, which means that the probability of detec¬ 
tion of an individual does not depend on species, 
then this strategy is justified. The assumption is 
seldom stated explicitly and almost never criticised, 
yet it seems unlikely that any community comprises 
individuals that are equally detectable irrespective 
of species, unless it is possible to enumerate all indi¬ 
viduals on plots. If detection is by trapping, indi¬ 
viduals of some species are more readily trapped 
than others; if it is by active searching, individuals 
of some species are more visible or audible than 
others. This issue is recognized, for example, in 
arthropod surveys, where different trapping meth¬ 
ods are often used in an attempt to achieve a more 
representative sample of individuals (e.g. Colwell & 
Coddington 1994). 

If we relax the assumption that all organisms are 
equally detectable, we can still use the measures of 
the previous section, but with counts replaced by 
abundance (or density) estimates. For a diversity, 
the H would be estimated for each plot, while for 
y diversity they would be estimated for the wider 
survey region, obtained from a combined analysis 
of the data from all sites in the region (with appro¬ 
priate allowance for survey design, e.g. if the sites 
are a stratified random sample, analysis should be 
appropriately stratified). 

3.3.2 Estimating individual detectability 

The problem remains of how to estimate detectabil¬ 
ity and hence abundance. The preferred approach 
will depend on the taxa being sampled and the 
environment they occupy. The simplest solution 
is to have sample plots that are sufficiently small 
that all individuals on the plot are detected. In this 
case the counts are the plot abundance estimates. 
Equally, abundance values for the wider region 
from which the plots were sampled are readily esti¬ 
mated. In the case of simple random sampling, the 
pi are then the same, whether based on counts or 
abundance estimates. This approach might be fea¬ 
sible with ground beetles, for example, or for large 
mammals in open habitat. However, the method is 
inefficient for most taxa because plots may have to 
be so small that most counts are zero. A further 
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Box 3.1 If we estimate species proportions by the proportions observed in our sample, 
are measures based on these proportions biased? 


Intuitively, it might seem that we would not introduce bias 
into biodiversity measures even when the 777 vary among 
species, provided that 7 r,- is independent of A/,-, that is that 
the detectability of individuals of a species is independent 
of species abundance. Thus, although < 7 , may be smaller 
than Pi for a less detectable species and larger for a more 
detectable species, we might hope that these biases 
compensate, so that the distributions of g, and p,- are the 
same. Unfortunately this is not the case. We can see this by 
considering the distributions across species i of £(/?,-) and 
A/,. We would like the distribution of A/,- across species to be 
identical to that of kE(ni) for some constant k (which 
cancels top and bottom when q, is evaluated, giving 
< 7 , = p^. Thus we would like the distributions of log e {N,) 
and log e [E(rii)] to be the same apart from location (which 
will differ by the factor log t k). However, log e [E (n,)] = 
log e {N,) + log e (7Ti), so that var{/og f [£(/?;)]} = 
m[log e (N,)] + var[/og,(7r,)] -f 2cov[/og e (A/,), 
log e (iT,)]. When v, and A/,- are independent, the 
covariance term is zero, and var {log e [f (n,-)]} > 
var[/og f (A/;)] except when 77,- = rr for all / (in which 
case var[/og e (7T/)] = 0). Thus if 777 and /V,- are 

problem for mobile populations is that an instanta¬ 
neous count of the plot is needed; if this cannot be 
achieved, then new individuals may move onto the 
plot while counting is being conducted, leading to 
overestimation of density. This overestimation will 
be greater for more mobile species, generating bias 
in the diversity measures. 

Another solution is to use mark-recapture to 
estimate detectabilities. The approach tends to be 
costly and is prone to bias because even within a 
species, capture probability can be heterogeneous 
among individuals. This is in part because ani¬ 
mals can move on and off the plot, and those with 
home ranges that are wholly in the plot are much 
more likely to be trapped than those whose home 
ranges are largely off the plot. This generates bias 
in detectability and abundance estimates. 

Distance sampling (Buckland 2001 ) can be an 
effective way of estimating detectability from 
sightings surveys (see Box 3.2). For biodiversity 
monitoring, a problem of distance sampling is 


independent, the distributions are identical apart from 
location only when detectability does not vary by species. 

The two distributions have the same variance 
when var[/og e (77y)] = — 2cov[/og e (A/,), tog s (7r,)], giving 
cov[log e (Ni), log e {TTj)] = -0.5 var[/og e (77y)]. If the 777 
are very variable, this corresponds to a strong negative 
correlation between detectability and abundance. 

These results indicate that we can generally expect the- < 7 , 
to be more variable across species than the p,- when 
detectabilities 77 ,- vary across species. Species evenness 
takes its maximum value when all the p,- are the same, 
corresponding to zero variance. As variability increases, 
evenness decreases. Thus, if we estimate p, by < 7 ,-, we 
expect to underestimate evenness and hence 
biodiversity. 

Conclusion 

Measures of biodiversity that are based on species 
proportions are biased when these are estimated from 
samples, unless individuals of different species are all 
equally detectable. 

that there are typically too few detections for 
many species to allow estimation of detectability. 
However, by pooling data across species and then 
using multiple-covariate distance sampling with 
species as a factor (Alldredge et al. 2007; Marques 
et al. 2007) this difficulty is largely avoided, 
although the detectability of the rarest species 
with, say, fewer than 10 detections might have to 
be assumed to be the same as for similar, more 
common, species (Section 3.4). 

Box 3.3 discusses different methods of 
detectability and distinguishes between individual 
detectability and species detectability. 

3.3.3 Species detectability 

For rare species, q, and p, will be zero if 
tt, = 0. This too is a detectability issue, but 
relates to the detectability of a species, rather 
than to the detectability of individuals. Although 
many methods exist to estimate the number of 
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Box 3.2 Distance sampling 


Distance sampling (Buckland 2001, 2004) covers 
several related methods in which distances of detected 
individuals from a line or point are recorded, from which 
the effective area corresponding to a count is estimated. 
The most commonly used method is line transect sampling, 
for which lines are placed randomly in the survey region or, 
more commonly, a set of equally spaced parallel lines is 
randomly superimposed on the survey region. An 
observer traverses each line, recording any animals 
detected within a distance w of the line, together with 
their distance from the line. These distances are used to 
estimate a detection function, which is the probability that 
an animal is detected, as a function of distance from the 
line. For the basic method, it is assumed that this 
probability is 1 at zero distance from the line, that is 
animals on the line are seen with certainty. Given an 


estimate of the detection function, we can estimate the 
proportion of animals detected within a strip extending a 
distance w from the line on either side. This allows us to 
estimate animal density by adjusting encounter rates (i.e. 
the number of animals detected per unit length of line) to 
allow for animals missed in this strip. Given random 
placement of an adequate number of lines (or a grid of 
lines) through the survey region, this density estimate is 
representative of the whole survey region, allowing 
abundance within that region to be estimated. Probability 
of detection tt, of individuals of species / comprises two 
components, one reflecting the probability that an animal 
within a surveyed strip of half-width w is detected and the 
other reflecting the probability that an animal is within a 
surveyed strip. 



Distance x from line 



The left-hand plot shows part of a line transect with 
positions of detected animals shown. The distances x are 
shown in the form of a histogram with three distance 
intervals in the middle plot, together with a half-normal 
detection function fitted to the distance data. For an animal 
that is within a survey strip, its probability of detection is 


represented in the right-hand plot by the proportion of the 
rectangle that is shaded, corresponding to the fitted 
half-normal function. 

This method is used to sample a wide variety of 
terrestrial, freshwater, and marine animals, as well as some 
plant populations (e.g. trees in rain forest). 


species that are not detected (see Chapter 4), the 
resulting estimates tend to be particularly sensitive 
to the assumptions made. Species detectabilities 
will be even more heterogeneous than individual 
detectabilities (because individuals of rare species 
are much less likely to be encountered than indi¬ 
viduals of common species), so estimators should 
accommodate this hetSfogeneity. Even so, it is a 
challenging problem to estimate the number of 
undetected species purely based on data from those 


species that are detected. Most measures of biodi¬ 
versity are not unduly affected if some rare species 
are estimated to have zero abundance. Species rich¬ 
ness is an exception to this. 

For regional monitoring there are three reasons 
why a species might not be detected: it might be 
present on at least one of the sample plots but not 
detected, it might be present in the region but not 
on any of the sample plots, and it might be absent 
from the region at the time of the survey. 
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Box 3.3 Which methods of estimating 
detectability might be appropriate 
for which taxa? 


Individual detectability 

Distance sampling (Buckland 2001, 2004) is useful for 
species where individuals are readily detectable from a 
line or point, at least if they are close to the observer. It 
can be effective for estimating the detectability of 
medium and large terrestrial mammals, marine 
mammals, most birds, many reptiles, many fish and 
benthic fauna, and plants where individual plants are 
readily identifiable. 

Mark-recapture (Borchers et al. 2002; Williams et al. 
2002 ) can be effective when trapping methods or other 
methods that involve capture are used. Groups for which 
probability of capture might be quantified using 
mark-recapture include small mammals, birds in rain 
forests, reptiles and amphibians, and insects. 

In rare circumstances when animals are trapped, a 
sufficiently large proportion of the population might be 
removed to allow removal and related catch methods 
(Borchers et al. 2002) to be used to estimate the 
trapping probability of individuals. 

Species detectability 

The species abundance distribution summarizes the 
counts of each species observed in a survey, recording 
the number of species represented by one, two, 
three,... individuals. To estimate the species richness of 
a community we need to estimate the number of species 
represented by no individuals in the sample. There are 
many possible methods for this (Magurran 2004; 

Chapter 4). The methods are applicable to any taxon, 
although estimation of the number of missing species 
tends to lack robustness when there is no replicate 
sampling and individuals of different species have 
different detectabilities. 

When there are repeat visits to each of a number of 
sites, occupancy methods (Mackenzie et al. 2006; 

Chapter 4) allow estimation of species richness from 
simple species lists compiled on each visit to each site. 
Again, the methods are applicable to any taxon. 

Rare species, which can account for a substan- 
hal fraction of the species richness in a com¬ 
munity (Coddington et al. 2009), may be missed 
during sampling simply by chance. Undersampling 


is particularly problematical in tropical arthropod 
assemblages. Species richness estimators offer one 
solution (Chapter 4) as long as there is no inherent 
bias in the sampling methodology. However, rare 
and common species may differ in ecology (for 
example by occupying different habitats (Magur¬ 
ran & Henderson 2003)). Moreover, most sampling 
techniques violate the assumption that individu¬ 
als are captured or recorded at random. A light 
trap is a classic example of this since not all moth 
species respond to the light source in the same 
way (Southwood & Henderson 2000). It is not 
just species that vary in their responses—male and 
female moths may react differently to the light stim¬ 
ulus (Altermatt et al. 2009). Heterogeneity in this 
sex difference amongst species will contribute to 
bias. Weather conditions can also affect sampling. 
Holyoak et al. (1997) found that the total num¬ 
ber of moths captured by a light trap was posi¬ 
tively correlated with temperature and negatively 
correlated with rainfall, and that the strength of 
this correlation varied with taxon. Preston (1948) 
argued that a moth trap is an acceptable method 
of sampling because, while species vary in their 
degree of phototropism, it is random with regard 
to commonness and rarity amongst those taxa that 
are equally attracted to the light. However, compar¬ 
isons of diversity amongst sites, or over time peri¬ 
ods, in which the detectability of species varies are 
challenging. 

Species might be missed because they are 
not present at the time of the survey. Temporal 
turnover in the form of local colonizations and 
extinctions is an intrinsic feature of all ecological 
communities (MacArthur & Wilson 1967). Species 
are gained and lost by island assemblages while 
seed banks may contain many species that are only 
occasionally seen in the 'expressed community'. 
This raises the conceptual issue of who belongs 
to the community. Should species that have been 
recorded in the past, and which may re-occur 
at some point in the future, be included in a 
species list? In practice of course it is not always 
easy to determine which form of absence— 
methodological or ecological—is involved. 
These issues are discussed further in Magurran 
(Chapter 7). Box 3.4 looks at ways in which 
detectability might be addressed in surveys of 
butterflies. 
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Figure 3.1 Estimated trends in breeding bird biodiversity in Scottish farmland, 1994-2007, assessed using Simpson's index 1 —D, the Shannon index H 
and the geometric mean 0. The indices derived from counts n, uncorrected for detectability are shown on the left-hand side, while those derived from 
abundance estimates A/, are shown on the right-hand side. 

are zero, we excluded any species for which there Table 3.3 Estimated slope of relationship between each of the follow- 


was at least one year during 1994-2007 in which 
no birds were recorded. These species were red- 
legged partridge, common quail, corncrake, tur¬ 
tle dove, yellow wagtail, brambling, and corn 
bunting. 

The above indices were evaluated for each year 
1994 to 2007 (Fig. 3.1). Tests for linear trend were 
also conducted (Table 3.3). 

As anticipated, diversity as measured by Simp¬ 
son's or the Shannon index is estimated to be 
greater when we allow for different detectabilities 
between species than when we do not (Fig. 3.1). 
However, the evidence for trend in these indices 
is largely unaffected (Table 3.3). For the geometric 
mean based on counts, we obtain evidence of an 
increase in biodiversity over the time period of the 
BBS (p = 0.024), but if we allow for detectability, 
the test for linear trertS becomes non-significant 
(p = 0.126, Table3.3). 


ing indices and year: Simpson's index (1—0), the Shannon index (H), and 
the geometric mean of relative abundance estimates (6). For each index, 
we show estimate (slope), standard error (s.e.) and the p value of the test 
of no slope for two cases: estimates calculated from counts uncorrected 
for detectability (subscript c) and estimates calculated from abundance 
estimates (subscript a). 


Index 

Slope 

s.e. 

P 

Index 

Slope 

s.e. 

P 

1-D C 

0.0024 

0.0011 

0.058 

1 -D a 

0.0031 

0.0015 

0.059 

He 

0.0072 

0.0047 

0,153 

Ha 

0.0076 

0.0050 

0.153 

G c 

0.0359 

0.0139 

0.024 

Ga 

0.0223 

0.0136 

0.126 


3.5 Discussion 

Ignoring detectability might not be a major prob¬ 
lem if the bias is consistent across time or space. 
However, if effort varies between times or locations, 
then bias arising from variable detectability among 
species will itself be variable, generating bias in 
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temporal or spatial trend estimates in biodiversity. 
We found clear evidence of bias in estimates of the 
Shannon and Simpson's indices from the Scottish 
farmland BBS data, when counts were uncorrected 
for detectability. In addition, we found that evi¬ 
dence for a trend in relative abundance, as mea¬ 
sured by the geometric mean across species, van¬ 
ished when the relative abundance estimates were 
corrected for detectability. 

The goal of many biodiversity studies is to com¬ 
pare sites or communities that reflect natural varia¬ 
tion (such as a successional or altitudinal gradient) 
or that have been subjected to anthropogenic 
impacts such as pollution or disturbance (Chap¬ 
ter 17). The localities or samples in such a compar¬ 
ison may differ in a number of ways. For exam¬ 
ple, the body size of organisms often changes when 
a site is polluted or a system is over-harvested— 
indeed the shifting relationship between biomass 
and numerical abundance is used as a measure of 
impact (Warwick & Clarke 1994). Body size is likely 
to influence detectability. Many sampling devices 
such as gill nets or plankton nets trap individu¬ 
als within a particular size range so, although it 
may seem sensible to deploy the same sampling 
gear when assessing biodiversity at different sites, 
variation in detectability could generate mislead¬ 
ing answers. Behavioural changes in harvested or 
hunted species (Casas et al. 2009) may also affect 
detectability. 

If detectability cannot readily be estimated as an 
integral part of a monitoring scheme, it may be pos¬ 
sible to carry out experiments at sample locations to 
estimate detection probabilities, although in some 
communities, such as speciose tropical arthropod 
assemblages, this will not be feasible. 

In our example, although we have corrected 
for detectability, we anticipate that some bias will 
remain. Our analyses assume that birds that are on 
or very close to the transect line will be detected 
with certainty, and this is unlikely to be the case 
for some species, especially nocturnal species. In 
addition, it is not always possible to follow the 
ideal transect line, and the actual line covered may 
be biased towards or away from habitats preferred 
by certain species. Nevertheless, bias in estimated 
biodiversity trends can be expected to have been 
reduced. 


Detectability can be particularly difficult to 
accommodate when most species are represented 
by very few individuals, so that even if the sur¬ 
vey design allows detectability to be estimated, 
there may be insufficient data for reliable estima¬ 
tion. For single-site monitoring in tropical environ¬ 
ments, typically around a third of the species are 
represented by just a single individual (Coddington 
et al. 2009). If the focus is on estimating regional 
biodiversity trends and data are gathered from a 
number of representative sites through the region, 
the proportion of species for which there are too 
few data to estimate detectability is likely to be 
smaller. This is related to the issue of 'repetitive 
sampling': repeated sampling of the same units 
results in more (apparently) rare species than when 
sampling occurs at the same intensity but where 
each areajs sampled just once (Dobyns 1997; McGill 
2003a). However, for highly diverse communities, 
the number of species with very few detections 
will remain large. Another strategy to reduce the 
number of apparently rare species in the sample is 
to use different trapping methods at each sample 
location, as a species that is unlikely to be caught 
by one method may be more likely to be caught 
by another (Longino et al. 2002). Of course, if the 
different methods sample different species entirely 
then, while the community is better represented, 
the proportion of species with very few detections 
for the data pooled across the trapping methods 
will be within the range observed for the separate 
methods. 

3.6 Prospectus 

The rising interest in monitoring regional biodiver¬ 
sity, sparked in large measure by the 2010 target, 
is starting to focus attention on the communities 
that are to be monitored. In statistical terminology, 
it is important to define the population (which typ¬ 
ically comprises individuals of a number of species 
within a region in our case) about which we wish to 
draw inference. In this context, it is important either 
to ensure that individuals of different species are 
equally likely to be in our sample or to gather data 
that allow estimation of detectability by species. 
Thus we anticipate that the design and field pro¬ 
tocols of surveys that are to be used to monitor 
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biodiversity trends will receive closer scrutiny than 
has occurred to date. 

We have shown how to adjust diversity indices 
when estimates of detectability are available from 
sample data. With respect to statistical inference, 
we expect further developments related to estima¬ 
tors and their properties, for when detectability 
varies among species. A thorough analysis would, 
for example, estimate uncertainty in the diversity 
estimates. 

Especially when we are interested in monitoring 
regional biodiversity, our focus is more on a rate 
of change of diversity (and change in that rate of 
change) than in a single index value. While only a 
linear regression model was fitted to the adjusted 
diversity estimates here to look at trends over time, 
more sophisticated models such as the generalized 
additive models used by Buckland et al. (2005) pro¬ 
vide greater flexibility, and thus potentially a better 
fit to the time series generated by the underlying 
ecological processes. 

However, the issue of missing species will 
inevitably remain a focus of concern and of 
research, in particular when we are looking at a 
regional scale. Species might be missing because of 
seasonal effects at the time of the study or because 
they are rare and therefore missed in the sampling 
process. In the latter case, when a large number of 
plots is sampled repeatedly over time, occupancy 
methods are likely to prove useful for addressing 
this difficulty. It remains to be studied more closely 
how strongly the adjusted indices are affected by 
this uncertainty about the total number of species. 
Survey design as well as data analysis should try 
to take seasonal or other ecological effects into 
account as far as possible to allow identification of 
the different effects which lead to missed species in 
sample data. 

3.7 Key points 

1. When using diversity measures such as the 

Shannon and Simpson's indices, researchers are 


making the implicit assumption that individ¬ 
uals have been sampled at random from the 
community. 

2. In most cases this assumption does not hold as 
there will be heterogeneity amongst individuals 
in the probability of detection. 

3. If probability of detection of individuals can be 
calculated for each species, indices can be cor¬ 
rected for detectability. 

4. Unseen species are also a detectability issue— 
in this case the species are too rare to have 
been detected by the sampling protocols or 
the protocols are ineffective for detecting the 
species. 

5. The Scottish farmland bird community was 
used as a case study and detection functions 
fitted for each species. Estimates of biodiver¬ 
sity made using the Shannon and Simpson's 
indices were higher when the data were cor¬ 
rected for detectability, and the geometric mean 
of relative abundance estimates showed evi¬ 
dence of a positive trend for the uncorrected 
data, but no evidence of a trend for the corrected 
data. 

6 . A drawback of this approach is that detec¬ 
tion functions cannot be fitted to rare species, 
so that either such species must be excluded 
from analysis or similar species must be iden¬ 
tified that are assumed to have the same 
detectability. 

7. If not estimated, variable detectability may 
introduce bias in many comparative inves¬ 
tigations of biodiversity, such as contrasts 
between polluted and unpolluted assemblages 
and estimation of temporal or spatial trends in 
diversity. 
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CHAPTER 4 


Estimating species richness 

Nicholas J. Gotelli and Robert K. Colwell 


4.1 Introduction 

Measuring species richness is an essential objec¬ 
tive for many community ecologists and conserva¬ 
tion biologists. The number of species in a local 
assemblage is an intuitive and natural index of 
community structure, and patterns of species rich¬ 
ness have been measured at both small (e.g. Blake 
& Loiselle 2000) and large (e.g. Rahbek & Graves 
2001) spatial scales. Many classic models in commu¬ 
nity ecology, such as the MacArthur-Wilson equi¬ 
librium model (MacArthur & Wilson 1967) and 
the intermediate disturbance hypothesis (Connell 
1978), as well as more recent models of neutral 
theory (Hubbell 2001), metacommunity structure 
(Holyoak et al. 2005), and biogeography (Gotelli 
et al. 2009) generate quantitative predictions of the 
number of coexisting species. To make progress in 
modelling species richness, these predictions need 
to be compared with empirical data. In applied 
ecology and conservation biology, the number of 
species that remain in a community represents the 
ultimate 'scorecard' in the fight to preserve and 
restore perturbed communities (e.g. Brook et al. 
2003). 

Yet, in spite of our familiarity with species rich¬ 
ness, it is a surprisingly difficult variable to mea¬ 
sure. Almost without exception, species richness 
can be neither accurately measured nor directly 
estimated by observation because the observed 
number of species is a downward-biased estimator 
for the complete (total) species richness of a local 
assemblage. Hundreds of papers describe statistical 
methods for correcting this bias in the estimation 
°f species richness (see also Chapter 3), and spe¬ 
cial protocols and methods have been developed 
for estimating species richness for particular taxa 
( e 'g- Agosti et al. 2000). Nevertheless, many recent 


studies continue to ignore some of the fundamental 
sampling and measurement problems that can com¬ 
promise the accurate estimation of species richness 
(Gotelli & Colwell 2001). 

In this chapter we review the basic statisti¬ 
cal issues involved with species richness estima¬ 
tion. Although a complete review of the subject is 
beyond the scope of this chapter, we highlight sam¬ 
pling models for species richness that account for 
undersampling bias by adjusting or controlling for 
differences in the number of individuals and the 
number of samples collected (rarefaction) as well as 
models that use abundance or incidence distribu¬ 
tions to estimate the number of undetected species 
(estimators of asymptotic richness). 

4.2 State of the field 

4.2.1 Sampling models for biodiversity data 

Although the methods of estimating species rich¬ 
ness that we discuss can be applied to assemblages 
of organisms that have been identified by genotype 
(e.g. Hughes et al. 2000), to species, or to some 
higher taxonomic rank, such as genus or family (e.g. 
Bush & Bambach 2004), we will write 'species' to 
keep it simple. Because we are discussing estima¬ 
tion of species richness, we assume that one or more 
samples have been taken, by collection or observa¬ 
tion, from one or more assemblages for some speci¬ 
fied group or groups of organisms. We distinguish 
two kinds of data used in richness studies: (1) inci¬ 
dence data, in which each species detected in a sam¬ 
ple from an assemblage is simply noted as being 
present, and (2) abundance data, in which the abun¬ 
dance of each species is tallied within each sample. 
Of course, abundance data can always be converted 
to incidence data, but not the reverse. 
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Box 4.1 Observed and estimated richness 


Sobs is the total number of species observed in a sample, or 
in a set of samples. 

S est is the estimated number of species in the 
assemblage represented by the sample, or by the set of 
samples, where est is replaced by the name of an estimator. 

Abundance data. Let 4 be the number of species each 
represented by exactly k individuals in a single sample. 

Thus, f 0 is the number of undetected species (species 
present in the assemblage but not included in the sample), 
4 is the number of singleton species, f 2 is the number of 
doubleton species, etc. The total number of individuals in 
Sobs 

the sample is n - 4 - 

*=i 

Replicated incidence data. Let q k be the number of 
species present in exactly k samples in a set of replicate 
incidence samples. Thus, q 0 is the number of undetected 
species (species present in the assemblage but not included 
in the set of samples), q, is the number of unique species, 
q 2 is the number of duplicate species, etc. The total number 

Sobs 

of samples is m = Y, < 7 '*• 

*=i 


Chao 1 (for abundance data) 


f2 

$chao-\ = Sobs + 2 % is the classic form, but is not defined 
when 4=0 (no doubletons). 

Sc/ao, = S 0 bs + f 2 {f' 2 +') is a bias-corrected form, always 
obtainable. 


var(5 c „ aol ) = f 2 I (A ) 2 + (A)' + 1 (-|) 4 


for 


4 > 0 and 4 > 0 (see Colwell 2009, Appendix B of 
Estimates User's Guide for other cases and for asymmetrical 
confidence interval computation). 


Chao 2 (for replicated incidence data) 


Schao 2 = Sobs + ^ is the classic form, but is not defined 
when q 2 = 0 (no duplicates). 

Schao 2 = Sobs + { 02 ~) 1^7 is a bias-corrected form, 
always obtainable. 


«ar(W) = Jl( J ) 2 + (J) S + 1 (J) 4 ' 


for 


q, > 0 and q 2 > 0 (see Colwell 2009, Appendix B of 
Estimates User's Guide for other cases and for asymmetrical 
confidence interval computation). 


ACE (for abundance data) 


10 

Srare = £ 4 is the number of rare species in a sample (each 
* =1 

with 10 or fewer individuals). 

Sobs 

Sabund = £ 4 is the number of abundant species in a 
*=n 

sample (each with more than 10 individuals). 

10 

n ra re = X! ^4 is the total number of individuals in the 

*=i 

rare species. 

The sample coverage estimate is C AC e = 1 - the 
proportion of all individuals in rare species that are not 
singletons. Then the ACE estimator of species richness is 
Sacs = S abund + + c£jy 2 AC E’ wh e r e vice 'S the 

coefficient of variation, 


Yace = max 


10 

EW- D4 

Srare k= 1 _ 

Gace (n rare ) (tl rare 1 ) 


- 1,0 


The formula for ACE is undefined when all rare species 
are singletons (4 = n raw , yielding C ACE = 0). In this case, 
compute the bias-corrected form of Chaol instead. 


ICE (for incidence data) 


10 

S in fr = J2 c lk is the number of infrequent species in a 

*=i 

sample (each found in 10 or fewer samples). 

Sobs 

S/req = X) q* is the number of frequent species in a 
*=n 

sample (each found in more than 10 samples). 

10 

n m f r = J2 kQk is the total number of incidences in the 

*=i 

infrequent species. 

The sample coverage estimate is C,c* = 1 - the 
proportion of all incidences of infrequent species that are 
not uniques. Then the ICE estimator of species richness is 

Cice = Sfreq + ^ + ^y} C E’ where yJ C e is the coefficient 
of variation, 


yJce~ max 


Sinfr 


10 

EW- 

ffhnfr k=] 


Vqk 


ClCE (Winfr 1 ) ( Pinfr) 


1,0 
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The formula for ICE is undefined when all infrequent 
species are uniques (g, = r)j n f r , yielding Co = 0). In this 
case, compute the bias-corrected form of Chao2 
instead. 


Jackknife estimators (for abundance data) 

The first-order jackknife richness estimator is 

$ jackknife 1 — $obs + 6 

The second-order jackknife richness estimator is 

Sjackknife2 — $obs + 2p — f 2 


Jackknife estimators (for incidence data) 

The first-order jackknife richness estimator is 

$ jackknife! = ^obs + Q 1 

The second-order jackknife richness estimator is 



-^jackknife! 


- 5 0 bs + 


g, (2m - 3) 


m 


<h ( m - 2) 2 
m(m - 1 ) 


By their nature, sampling data document only 
the verified presence of species in samples. The 
absence of a particular species in a sample may 
represent either a true absence (the species is not 
present in the assemblage) or a false absence (the 
species is present, but was not detected in the 
sample; see Chapter 3). Although the term 'pres¬ 
ence/absence data' is often used as a synonym for 
incidence data, the importance of distinguishing 
true absences from false ones (not only for rich¬ 
ness estimation, but in modelling contexts, e.g. Elith 
et al. 2006) leads us to emphasize that incidence 
data are actually 'presence data'. Richness esti¬ 
mation methods for abundance data assume that 
organisms can be sampled and identified as dis¬ 
tinct individuals. For clonal and colonial organisms, 
such as many species of grasses and corals, indi¬ 
viduals cannot always be separated or counted, but 
methods designed for incidence data can nonethe¬ 
less be used if species presence is recorded within 
standardized quadrats or samples (e.g. Butler & 
Chazdon 1998). 

Snacking from a jar of mixed jellybeans provides 
a good analogy for biodiversity sampling (Longino 
et al. 2002). Each jellybean represents a single indi¬ 
vidual, and the different colours represent the dif¬ 
ferent species in the jellybean 'assemblage'—in a 
fypical sample, some colours are common, but most 
are rare. Collecting a sample of biodiversity data 
is equivalent to taking a small handful of jelly¬ 
beans from the jar and examining them one by 
°ne. From this incomplete sample, we try to make 


inferences about the number of colours (species) in 
the entire jar. This process of statistical inference 
depends critically on the biological assumption that 
the community is 'closed,' with an unchanging total 
number of species and a steady species abundance 
distribution. Jellybeans may be added or removed 
from the jar, but the proportional representation of 
colours is assumed to remain the same. In an open 
metacommunity, in which the assemblage changes 
size and composition through time, it may not be 
possible to draw valid inferences about community 
structure from a snapshot sample at one point in 
time (Magurran 2007). Few, if any, real communities 
are completely 'closed', but many are sufficiently 
circumscribed that that richness estimators may be 
used, but with caution and caveats. 

For all of the methods and metrics (Box 4.1) that 
we discuss in this chapter, we make the closely 
related statistical assumption that sampling is with 
replacement. In terms of collecting inventory data 
from nature, this assumption means either that indi¬ 
viduals are recorded, but not removed, from the 
assemblage (e.g. censusing trees in a plot) or, if 
they are removed, the proportions remaining are 
unchanged by the sampling. 

This framework of sampling, counting, and iden¬ 
tifying individuals applies not only to richness esti¬ 
mation, but also to many other questions in the 
study of biodiversity, including the characterization 
of the species abundance distribution (see Chap¬ 
ter 9) and partitioning diversity into a and |3 com¬ 
ponents (see Chapters 6 and 7). 
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Figure 4.1 Species accumulation and rarefaction curves. The 
jagged line is the species accumulation curve for one of many 
possible orderings of 121 soil seedbank samples, yielding a total 
of 952 individual tree seedlings, from an intensive census of a plot 
of Costa Rican rainforest (Butler & Chazdon 1998). The cumulative 
number of tree species (/-axis) is plotted as a function of the 
cumulative number of samples (upper x-axis), pooled in random 
order. The smooth, solid line is the sample-based rarefaction curve 
for the same data set, showing the mean number of species for all 

possible combinations of 1, 2 . rrf . 121 actual 

samples from the dataset—this curve plots the statistical 
expectation of the (sample-based) species accumulation curve. 

The dashed line is the individual-based rarefaction curve for the 
same data set—the expected number of species for 
(m*) (952/121) individuals, randomly chosen from all 952 
individuals (lower x-axis). The black dot indicates the total 
richness for all samples (or all individuals) pooled. The 
sample-based rarefaction curve lies below the individual-based 
rarefaction curve because of spatial aggregation within species. 
This is a very typical pattern for empirical comparisons of 
sample-based and individual-based rarefaction curves. 
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4.2.2 The species accumulation curve 

Consider a graph in which the x-axis is the num¬ 
ber of individuals sampled and the y-axis is the 
cumulative number of species recorded (Fig. 4.1, 
lower x-axis). Imagine taking one jellybean at a time 
from the jar, at random. As more individuals (jelly¬ 
beans) are sampled, the total number of species 
(colours) recorded in the sample increases, and a 
species accumulation curve is generated. Of course, 
the first individual drawn will represent exactly one 
species new to the sample, so all species accumu¬ 
lation curves based on individual organisms origi¬ 
nate at the point [1,1]. The next individual drawn 
will represent either the same species or a species 
new to the sample. The probability of drawing a 
new species will depend both on the complete num¬ 
ber of species in the assemblage and their relative 
abundances. The more species in the assemblage 
and the more even the species abundance distribu¬ 
tion (see Chapter 9), the more rapidly this curve will 
rise. In contrast, if the species abundance distribu¬ 
tion is highly uneven (a few common species and 
many rare ones, for example), the curve will rise 
more slowly, even at the outset, because most of the 
individuals sampled will represent more common 
species that have already been added to the sample, 
rather than rarer ones that have yet to be detected. 


Regardless of the species abundance distribu¬ 
tion, this curve increases monotonically, with a 
decelerating slope. For a given sample, different 
stochastic realizations of the order in which the 
individuals in the sample are added to the graph 
will produce species accumulation curves that dif¬ 
fer slightly from one another. The smoothed aver¬ 
age of these individual curves represents the sta¬ 
tistical expectation of the species accumulation 
curve for that particular sample, and the variabil¬ 
ity among the different orderings is reflected in 
the variance in the number of species recorded for 
any given number of individuals. However, this 
variance is specific, or conditional, on the particu¬ 
lar sample that we have drawn because it is based 
only on re-orderings of that single sample. Suppose, 
instead, we plot the smoothed average of several 
species accumulation curves, each based on a dif¬ 
ferent handful of jellybeans from the same jar, each 
handful having the same number of beans. Varia¬ 
tion among these smoothed curves from the several 
independent, random samples represents another 
source of variation in richness, for a given number 
of individuals. The variance among these curves is 
called an unconditional variance because it estimates 
the true variance in richness of the assemblage. The 
unconditional variance in richness is necessarily 
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larger than the variance conditional on any single 
sample. 

4.2.3 Climbing the species accumulation 
curve 

In theory, finding out how many species character¬ 
ize an assemblage means sampling more and more 
individuals until no new species are found and the 
species accumulation curve reaches an asymptote. 
In practice, this approach is routinely impossible for 
two reasons. First, the number of individuals that 
must be sampled to reach an asymptote can often be 
prohibitively large (Chao et al. 2009). The problem 
is most severe in the tropics, where species diversity 
is high and most species are rare. For example, after 
nearly 30 consecutive years of sampling, an ongo¬ 
ing inventory of a tropical rainforest ant assemblage 
at La Selva, Costa Rica, has still not reached an 
asymptote in species richness. Each year, one or two 
new species are added to the local list. In some cases 
these species are already known from collections at 
other localities, but in other cases they are new to 
science (Longino et al. 2002). In other words, bio¬ 
diversity samples, even very extensive ones, often 
fall short of revealing the complete species richness 
for an assemblage, representing some unspecified 
milestone along a slowly rising species accumula¬ 
tion curve with an unknown destination. 

A second reason that the species accumula¬ 
tion curve cannot be used to directly determine 
species richness is that, in field sampling, ecolo¬ 
gists almost never collect random individuals in 
sequence. Instead, individual plants or mobile ani¬ 
mals are often recorded from transects or points 
counts, or individual organisms are collected in pit- 
fall and bait traps, sweep samples, nets, plankton 
tows, water, soil, and leaf litter samples, and other 
taxon-specific sampling units that capture multi¬ 
ple individuals (Southwood & Henderson 2000). 
Although these samples can, under appropriate 
circumstances, be treated as independent of one 
another, the individuals accumulated within a sin¬ 
gle sample do not represent independent observa¬ 
tions. Although individuals contain the biodiver¬ 
sity information' (species identity), it is the sam¬ 
ples that represent the statistically independent 
replicates for analysis. When spatial and temporal 


autocorrelation is taken into account, the samples 
themselves may be only partially independent. 
Nevertheless, the inevitable non-independence of 
individuals within samples can be overcome by 
plotting a second kind of species accumulation 
curve, called a sample-based species accumulation 
curve, in which the x-axis is the number of samples 
and the y-axis is the accumulated number of species 
(Fig. 4.1, upper x-axis). Because only the identity 
but not the number of individuals of each species 
represented within a sample is needed to construct 
a sample-based species accumulation curve, these 
curves plot incidence data. This approach is there¬ 
fore also suitable for clonal and colonial species that 
cannot be counted as discrete individuals. 


4.2.4 Species richness versus species density 

The observed number of species recorded in a sam¬ 
ple (or a set of samples) is very sensitive to the 
number of individuals or samples observed or col¬ 
lected, which in turn is influenced by the effec¬ 
tive area that is sampled and, in replicated designs, 
by the spatial arrangement of the replicates. Thus, 
many measures reported as 'species richness' are 
effectively measures of species density: the number 
of species collected in a particular total area. For 
quadrat samples or other methods that sample a 
fixed area, species density is expressed in units of 
species per specified area. Even for traps that col¬ 
lect individuals at a single point (such as a pitfall 
trap), there is probably an effective sampling area 
that is encompassed by data collection at a single 
point. 

Whenever sampling is involved, species density 
is a slippery concept that is often misused and 
misunderstood. The problem arises from the non¬ 
linearity of the species accumulation curve. Con¬ 
sider the species accumulation curve for rainforest 
seedlings (Butler & Chazdon 1998) in Fig. 4.2, which 
plots the species of seedlings grown from dormant 
seed in 121 soil samples, each covering a soil surface 
area of 17.35 cm 2 and a depth of 10 cm. The x-axis 
plots the cumulative surface area of soil sampled. 
The slopes of lines A, B, and C represent species 
density: number of species observed (y), divided by 
area-sampled (x). You can see that species density 



44 BIOLOGICAL DIVERSITY 


Figure 4.2 Species richness and species density are not the same 
thing. The solid line is the sample-based rarefaction curve for the 
same data set as in Fig 4.1, showing the expected species richness of 

rainforest tree seedlings for 1, 2. rrf, .... 121 soil samples, 

each covering a soil surface area of 17.35 cm 2 and a depth of 
10 cm. Species richness (y-axis) is plotted as a function of the total 
soil surface area sampled (x-axis). Because species density is the ratio 
of richness (/-coordinate) to area (x-coordinate) for any point in the 
graph, the slopes of lines A, B, and C quantify species density for 
500, 1000, and 2000cm 2 , respectively. Clearly, species density 
estimates depend on the particular amount of area sampled. All of 
the species density slopes over-estimate species number when 
extrapolated to larger areas, and species density estimates based on 
differing areas are not comparable. 


depends critically not just on area, but on the spe¬ 
cific amount of area sampled. For this reason, it 
never works to 'standardize' the species richness 
of samples from two or more assemblages by sim¬ 
ply dividing observed richness by area sampled (or 
by any other measure of effort, including number 
of individuals or number of samples). Estimating 
species density by calculating the ratio of species 
richness to area sampled will always grossly over¬ 
estimate species density when this index is extrap¬ 
olated to larger areas, and the size of that bias will 
depend on the area sampled. 

Sometimes, however, ecologists or conservation 
biologists are interested in species density, for some 
particular amount of area, in its own right. For 
example, if only one of two areas, equal in size and 
cost per hectare, can be purchased to establish a 
reserve, species density at the scale of the reserve is 
clearly a variable of interest. Because specie s density 
is so sensitive to area (and, ultimately, to the num¬ 
ber of individuals observed or collected), it is useful 
to decompose it into the product of two quanti¬ 
ties: species richness (number of species represented 
by some particular number, N, of individuals) and 
total individual density (number of individuals N, 
disregarding species, in some particular amount of 
area A): 

species\ _ / species \ f N individuals\ 

area a) \N individuals) 1 area A ) 



(James & Warner 1982). This decomposition demon¬ 
strates that the number of species per sampling unit 
reflects both the underlying species richness and 
the total number of individuals sampled. If two 
samples differ in species density, is it because of 
differences in underlying species richness, differ¬ 
ences in abundance, or some combination of both? 
In other words, how do we meaningfully com¬ 
pare the species richness of collections that prob¬ 
ably differ in both the number of individuals and 
the number of samples collected? Until recently, 
many ecologists have not recognized this prob¬ 
lem. The distinction between species density and 
species richness has not always been appreciated, 
and many papers have compared species density 
using standard parametric statistics, but without 
accounting for differences in abundance or sam¬ 
pling effort. 

One statistical solution is to treat abundance, 
number of samples, or sample area as a covariate 
that can be entered into a multiple regression analy¬ 
sis or an analysis of covariance. If the original data 
(counts and identities of individuals) are not avail¬ 
able, this may be the best that we can do. For exam¬ 
ple, Dunn et al. (2009) assembled a global database 
of ant species richness from a number of published 
studies. To control for sampling effects, they used 
the area, number of samples, and total number of 
individuals from each sample location as statisti¬ 
cal covariates in regression analyses. However, they 
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did not make the mistake of trying to 'standard¬ 
ize' the richness of different samples by dividing 
the species counts by the area, the number of indi¬ 
viduals sampled, or any other measure of effort. 
As we have repeatedly emphasized, this rescaling 
produces serious distortions: extrapolations from 
small sample ratios of species density inevitably 
lead to gross over-estimates of the number of 
species expected in larger sample areas (Fig. 4.2 and 
Figure 4-6 in Gotelli & Colwell 2001). 

4.2.5 Individual-based rarefaction 

The species accumulation curve itself suggests an 
intuitive way to compare the richness of two sam¬ 
ples (for the same kind of organism) that differ in 
the number of individuals collected. Suppose one 
of the two samples has N individuals and S species, 
and the other has n individuals and s species. The 
samples differ in the number of individuals present 
(N > n) and will usually differ in the number of 
species present (typically S > s). In the procedure 
called rarefaction, we randomly draw n* individuals, 
subsampling without replacement from the larger 
of the two original samples, where n* = n, the size 
of the smaller original sample. (This re-sampling, 
without replacement, of individuals from within 
the sample does not violate the assumption that the 
process of taking the sample itself did not change 
the relative abundance of species). Computing the 
mean number of species, s’, among repeated sub¬ 
samples of n* individuals estimates E(s’|n*), the 
expected number of species in a random subsam¬ 
ple of n* individuals from the larger original sam¬ 
ple (Fig. 4.1, lower x-axis). The variance of (s’), 
among random re-orderings of individuals, can also 
be estimated this way along with a parametric 95% 
confidence interval, or the confidence interval can 
be estimated from the bootstrapped values (Manly 
1991). 

A simple test can now be conducted to ask 
whether s, the observed species richness of the com¬ 
plete smaller sample, falls within the 95% confi¬ 
dence interval of s’, the expected species richness 
based on random subsamples of size n from the 
larger sample (Simberloff 1978). If the observed 
value falls within the confidence interval, then the 
hypothesis that the richness of the smaller sample. 


based on all n individuals, does not differ from the 
richness of a subsample of size n* from the larger 
sample cannot be rejected at P < 0.05. If this null 
hypothesis is not rejected, and the original, unrar¬ 
efied samples differed in species density, then this 
difference in species density must be driven by 
differing numbers of individuals between the two 
samples. Alternatively, if s is not contained within 
the confidence interval of s’, the two samples differ 
in species richness in ways that cannot be accounted 
for entirely by differences in abundance and/or 
sampling effort (at P < 0.05). 

Rarefaction can be used not only to calculate a 
point estimate of s’, but also to construct an entire 
rarefaction curve in which the number of individuals 
randomly subsampled ranges from 1 to N. Rarefac¬ 
tion can be thought of as a method of interpolating 
£(s*|h*) the expected number of species, given n * 
individuals (1 < n* < N), between the point [1, 1] 
and the point [S, N] (Colwell et al. 2004). With pro¬ 
gressively smaller subsamples from N - 1 to 1, the 
resulting individual-based rarefaction curve, in a sense, 
is the reverse of the corresponding species accumu¬ 
lation curve, which progressively builds larger and 
larger samples. 

Because this individual-based rarefaction curve 
is conditional on one particular sample, the vari¬ 
ance in s’, among random re-orderings of indi¬ 
viduals, is 0 at both extremes of the curve: with 
the minimum of only one individual there will 
always be only one species represented, and with 
the maximum of N individuals, there will always be 
exactly S species represented. Hurlbert (1971) and 
Heck et al. (1975) give analytical solutions for the 
expectation and the conditional variance of s’, which 
are derived from the hypergeometric distribution. 
In contrast, treating the sample (one handful of 
jellybeans) as representative of a larger assemblage 
(the jar of jellybeans) requires an estimate of the 
unconditional variance (the variance in s*\n* among 
replicate handfuls of jellybeans from the same jar). 
The unconditional variance in richness, S, for the 
full sample of N individuals, must be greater than 
zero to account for the heterogeneity that would 
be expected with additional random samples of 
the same size taken from the entire assemblage. 
Although Smith & Grassle (1977) derived an esti¬ 
mator for the unconditional variance of E(s*\n*), 
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it is computationally complex and has been little 
used. R.K. Colwell and C.X. Mao (in preparation) 
have recently derived an unconditional variance 
estimator for individual-based rarefaction that is 
analogous to the unconditional variance estimator 
for sample-based rarefaction described in Colwell 
et al. (2004), and discussed below. 

Regardless of how the variance is estimated, the 
statistical significance of the difference in rarefied 
species richness between two samples will depend, 
in part, on n, the number of individuals being com¬ 
pared. This sample-size dependence arises because 
all rarefaction curves based on individuals con¬ 
verge at the point [1,1], Therefore, no matter how 
different two assemblages are, rarefaction curves 
based on samples of individuals drawn at ran¬ 
dom will not appear to differ statistically if n is 
too small. In some cases, rarefaction curves may 
cross at higher values of n, making the results of 
statistical tests even more dependent on n (e.g. 
Raup 1975). 

To compare multiple samples, each can be rar¬ 
efied down to a common abundance, which will 
typically be the total abundance for the smallest of 
the samples. At that point, the set of s* values, one 
for each sample, can be used as a response variable 
in any kind of statistical analysis, such as ANOVA 
or regression. This method assumes that the rarefac¬ 
tion curves do not cross (which may be assessed 
visually), so that their rank order remains the same 
regardless of the abundance level used. Alterna¬ 
tively, multiple samples from the same assemblage 
can be used in a sample-based rarefaction, which we 
describe below. 

Rarefaction has a long history in ecology and evo¬ 
lution (Sanders 1968; Hurlbert 1971; Raup 1975; Tip¬ 
per 1979; Jarvinen 1982; Chiarucci et al. 2008).The 
method was proposed in the 1960s and 1970s to 
compare species number when samples differed 
in abundance (Tipper 1979), but the same statisti¬ 
cal problem had been solved many decades ear¬ 
lier by biogeographers who wanted to estimate 
species/genus ratios and other taxonomic diversity 
indices (Jarvinen 1982). 

Brewer & Williamson (1994) and Colwell & Cod- 
dington (1994) pointed out that a very close approx¬ 
imation for the rarefaction curve is the Coleman 
'passive sampling' curve. 


s 

E (s*) = Y2 [l — (1 — m*/N)"' ]. (4-1) 

i=i 

in which i indexes species from 1 to S, and n, is the 
abundance of species i in the full sample. As a null 
model for the species-area relationship (see Chap¬ 
ter 20), the Coleman curve assumes that islands of 
different area randomly intercept individuals and 
accumulate different numbers of species (Coleman 
et al. 1982). The individual-based rarefaction curve 
is very closely analogous to the Coleman curve 
(and, although mathematically distinct, differs only 
slightly from it) because relative island area is a 
proxy for the proportion n* /N of individuals sub¬ 
sampled from the pooled distribution of all individ¬ 
uals in the original sample (Gotelli 2008). 

4.2.6 Sample-based rarefaction 

Individual-based rarefaction computes the 
expected number of species, s*, in a subsample 
of n* individuals drawn at random from a single 
representative sample from an assemblage. In 
contrast, sample-based rarefaction computes the 
expected number of species s* when m* samples 
(1 < m* < M) are drawn at random (without 
replacement) from a set of samples that are, 
collectively, representative of an assemblage 
(Fig. 4.1, upper x-axis) (Gotelli & Colwell 2001; 
Colwell et al. 2004). (This re-sampling, without 
replacement, of samples from within the sample 
set does not violate the assumption that the process 
of taking the sample itself did not change the 
relative abundance of species.) The fundamental 
difference is that sample-based rarefaction, by 
design, preserves the spatial structure of the 
data, which may reflect processes such as spatial 
aggregation or segregation (see Chapter 12) 
both within and between species. In contrast, 
individual-based rarefaction does not preserve the 
spatial structure of the data and assumes complete 
random mixing among individuals of all species. 
Thus, for sample-based rarefaction, E (s*|m*) is the 
expected number of species for m* pooled samples 
that express the same patterns of aggregation, 
association, or segregation as the observed set of 
samples. For this reason, sample-based rarefaction 
is a more realistic treatment of the independent 
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sampling units used in most biodiversity studies. 
Because sample-based rarefaction requires only 
incidence data, it can also be used for clonal 
organisms or for species in which individuals 
in a sample cannot be easily distinguished or 
counted. 

Operationally, sample-based rarefaction can be 
carried out by repeatedly selecting and pooling 
m* samples at random from the set of samples, 
and computing the mean and conditional (on the 
particular set of samples) variance and 95% confi¬ 
dence interval for s*. On the other hand, E (s*|m*) 
is more easily and accurately computed from com¬ 
binatorial equations based on the distribution of 
counts, the number of species found in exactly 1, 
2, ..., m * samples in the set (Ugland et al. 2003; 
Colwell et al. 2004; see Chiarucci et al. 2008 for 
a history of this approach). Colwell et al. 2004 
also introduced a sample-based version of the 
Coleman rarefaction model, the results of which 
closely approximate the true sample-based rarefac¬ 
tion curve. 

Ugland et al. (2003) provide an expression for 
the conditional variance in richness estimates from 
sample-based rarefaction. Colwell et al. (2004) 
derived an unconditional variance estimator for 
sample-based rarefaction that treats the observed 
set of samples, in turn, as a sample from some 
larger assemblage, so that the variance in S for 
all M samples, pooled (the full set of samples), 
takes some non-zero value. This unconditional vari¬ 
ance (and its associated confidence interval (Cl)) 
accounts for the variability expected among repli¬ 
cate sets of samples. Based on unconditional vari¬ 
ances for two sample-based rarefaction curves, rich¬ 
ness can be compared for any common number of 
samples (or individuals, as explained below). Using 
eigenvalue decomposition, Mao & Li (2009) devel¬ 
oped a computationally complex method for com¬ 
paring two sample-based rarefaction curves in their 
entirety. A much simpler, but approximate, method 
is to assess, for a desired value of m*, whether or 
not the two (appropriately computed) confidence 
intervals overlap. If the two CIs (calculated from the 
unconditional variance) are approximately equal, 
for a type I error rate of P < 0.05, the appropriate 
Cl is about 84% (Payton et al. 2003; the z value 
for 84% Cl is 0.994 standard deviations). Basing the 


test on the overlap of traditional 95% CIs is overly 
conservative: richness values that would differ sig¬ 
nificantly with the 84% interval would often be 
declared statistically indistinguishable because the 
95% intervals for the same pair of samples would 
overlap (Payton et al. 2003). 

An important pitfall to avoid in using sample- 
based rarefaction to compare richness between 
sample sets is that the method does not directly con¬ 
trol for differences in overall abundance between 
sets of samples. Suppose two sets of samples are 
recorded from the same assemblage, but they dif¬ 
fer in mean number of individuals per sample 
(systematically or by chance). When plotted as a 
function of number of samples (on the x-axis) the 
sample-based rarefaction curve for the sample set 
with a higher mean abundance per sample will lie 
above the curve for the sample set with lower mean 
abundance because more individuals reveal more 
species. The solution suggested by Gotelli & Col¬ 
well (2001) is to first calculate sample-based rarefac¬ 
tion curves and their variances (or CIs) for each set 
of samples in the analysis. Next, the curves are re¬ 
plotted against an x-axis of individual abundance, 
rather than number of samples. This re-plotting 
effectively shifts the points of each individual-based 
rarefaction curve to the left or the right, depending 
on the average number of individuals that were 
collected in each sample. Ellison et al. (2007) used 
this method to compare the efficacy of ant sam¬ 
pling methods that differed greatly in the average 
number of individuals per sample (e.g. 2 ants per 
pitfall trap, versus > 89 ants per plot for standard¬ 
ized hand sampling). Note that if sample-based rar¬ 
efaction is based on species occurrences rather than 
abundances, then the rescaled x-axis is the number 
of species occurrences, not the number of individu¬ 
als. 

4.2.7 Assumptions of rarefaction 

To use rarefaction to compare species richness of 
two (or more) samples or assemblages rigorously, 
the following assumptions should be met: 

1. Sufficient sampling. As with any other statis¬ 
tical procedure, the power to detect a dif¬ 
ference, if there is one, depends on having 
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large enough individuals or samples, especially 
since rarefactions curves necessarily converge 
towards the origin. Although it is difficult to give 
specific recommendations, our experience has 
been that rarefaction curves should be based on 
at least 20 individuals (individual-based rarefac¬ 
tion) or 20 samples (sample-based rarefaction), 
and preferably many more. 

2. Comparable sampling methods. Because all sam¬ 
pling methods have inherent and usually 
unknown sampling biases that favour detection 
of some species but not others (see Chapter 3), 
rarefaction cannot be used to compare data from 
two different assemblages that were collected 
with two different methods (e.g. bait samples vs 
pitfall traps, mist-netting vs point-sampling for 
birds). However, rarefaction can be used mean¬ 
ingfully to compare the efficacy of different sam¬ 
pling methods that are used in the same area 
(Longino et al. 2002; Ellison et al. 2007). Also, 
data from different sampling methods may be 
pooled in order to maximize the kinds of species 
that may be sampled with different sampling 
methods (e.g. ants in Colwell et al. (2008)). How¬ 
ever, identical sampling and pooling procedures 
must to be employed to compare two composite 
collections. 

3. Taxonomic similarity. The assemblages repre¬ 
sented by the two samples should be taxonom- 
ically 'similar'. In other words, if two samples 
that differ in abundance but have rarefaction 
curves with identical shapes do not share any 
taxa, we would not want to conclude that the 
smaller collection is a random subsample of the 
larger (Tipper 1979). Rarefaction seems most use¬ 
ful when the species composition of the smaller 
sample appears to be a nested or partially nested 
subset of the larger collection. Much more pow¬ 
erful methods are now available to test directly 
for differences in species composition (Chao 
et al. 2005). 

4. Closed communities of discrete individuals. The 
assemblages being sampled should be well cir¬ 
cumscribed, with consistent membership. Dis¬ 
crete individuals in a single sample must 
be countable (individual-based rarefaction) or 
species presence in multiple samples must be 
detectable (sample-based rarefaction). 


5. Random placement. Individual-based rarefaction 
assumes that the spatial distribution of individ¬ 
uals sampled is random. If individuals within 
species are spatially aggregated, individual- 
based rarefaction will over-estimate species rich¬ 
ness because it assumes that the rare and com¬ 
mon species are perfectly intermixed. Some 
authors have modified the basic rarefaction 
equations to include explicit terms for spatial 
clumping (Kobayashi & Kimura 1994). However, 
this approach is rarely successful because the 
model parameters (such as the constants in the 
negative binomial distribution) cannot be eas¬ 
ily and independently estimated for all of the 
species in the sample. One way to deal with 
aggregation is to increase the distance or timing 
between randomly sampled individuals so that 
patterns of spatial or temporal aggregation are 
not so prominent. An even better approach is to 
use sample-based rarefaction, again employing 
sampling areas that are large enough to over¬ 
come small-scale aggregation. 

6. Independent, random sampling. Individuals or 
samples should be collected randomly and 
independently. Both the individual-based and 
sample-based methods described in this chap¬ 
ter assume that sampling, from nature, does not 
affect the relative abundance of species (statis¬ 
tically, sampling with replacement). However, 
if the sample is relatively small compared to 
the size of the underlying assemblage (which is 
often the case), the results should be similar for 
samples collected with or without replacement. 
More work is needed to derive estimators that 
can be used for sampling without replacement, 
which will be important for cases in which the 
sample represents a large fraction of the total 
assemblage. Unfortunately, as we have noted 
earlier, biodiversity data rarely consist of col¬ 
lections of individuals that were sampled ran¬ 
domly. Instead, the data often consist of a series 
of random and approximately independent sam¬ 
ples that contain multiple individuals. 

4.2.8 Estimating asymptotic species richness 

Consider the species richness of a single biodiver¬ 
sity sample (or the pooled richness of a set of sam- 
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pies) as the starting point in a graph of richness 
versus abundance or sample number (the dot at the 
right-hand end of the curves in Fig. 4.1). Rarefac¬ 
tion amounts to interpolating 'backward' from the 
endpoint of a species accumulation curve, yielding 
estimates of species richness expected for smaller 
numbers of individuals or samples. In contrast, 
using this starting point to estimate the complete 
richness of the assemblage, including species that 
were not detected by the sample, can be visualized 
as extrapolating 'forward' along a hypothetical pro¬ 
jection the accumulation curve (Colwell et al. 2004, 
their Figure 4). Two objectives of extrapolation can 
be distinguished: (1) estimating the richness of a 
larger sample and (2) estimating the complete rich¬ 
ness of the assemblage, visualized as the asymptote 
of the accumulation curve. Once this asymptote is 
reached, the species accumulation curve is flat and 
additional sampling will not yield any additional 
species. 

Why should the species accumulation curve have 
an asymptote? On large geographical scales, it does 
not: larger areas accumulate species at a constant 
or even an increasing rate because expanded sam¬ 
pling incorporates diverse habitat types that sup¬ 
port distinctive species assemblages (see Chap¬ 
ter 20). As a consequence, the species accumulation 
curve continues to increase, and will not reach a 
final asymptote until it approaches the total area 
of the biosphere. The subject of species turnover is 
covered by Jost et al. and Magurran (Chapters 6 
and 7) and species-area relationships are the subject of 
Chapter 20. In this chapter, we focus on the estima¬ 
tion of species richness at smaller spatial scales— 
scales at which an asymptote is a reasonable sup¬ 
position and sampling issues are substantially more 
important than spatial turnover on habitat mosaics 
or gradients (Cam et al. 2002). In statistical terms, 
we assume that samples were drawn independently 
and at random from the local assemblage, so that 
the ordering of the samples in time or space is not 
important. In fact, unimportance of sample order is 
diagnostic of the kinds of sample sets appropriately 
used by ecologists to assess local species richness 
(Colwell et al. 2004). 

The most direct approach to estimating the 
species richness asymptote is to fit an asymp¬ 
totic mathematical function (such as the Michaelis- 


Menten function; Keating & Quinn (1998)) to a 
rarefaction or species accumulation curve. This 
approach dates back at least to Holdridge et al. 
(1971), who fitted a negative binomial function to 
smoothed species accumulation curves to compare 
the richness of Costa Rica trees at different local¬ 
ities. Many other asymptotic functions have since 
been explored (reviewed by Colwell & Coddington 
(1994), Flather (1996), Chao (2005), and Rosenzweig 
et al. (2003)). Unfortunately, this strictly phenom¬ 
enological method, despite the advantage that it 
makes no assumptions about sampling schemes or 
species abundance distributions, does not seem to 
work well in practice. Two or more functions may 
fit a dataset equally well, but yield drastically dif¬ 
ferent estimates of asymptotic richness (Soberon & 
Llorente 1993; Chao 2005), and variance estimates 
for the asymptote are necessarily large. Residual 
analysis often reveals that the popular functions 
do not correctly fit the shape of empirical species 
accumulation curves (O'Hara 2005), and this curve¬ 
fitting method consistently performs worse than 
other approaches (Walther & Moore 2005; Walther 
& Morand 2008). For these reasons, we do not rec¬ 
ommend fitting asymptotic mathematical functions 
as a means of estimating complete species richness 
of local assemblages. 

Mixture models, in which species abundance 
or occurrence distributions are modelled as a 
weighted mixture of statistical distributions, offer 
a completely different, non-parametric approach to 
extrapolating an empirical rarefaction curve to a 
larger sample sizes (or a larger set of samples) 
(reviewed by Mao et al. (2005), Mao & Colwell 
(2005), and Chao (2005)). Colwell et al. (2004), for 
example, modelled the sample-based rarefaction 
curve as a binomial mixture model. However, these 
models are effective only for a doubling or tripling 
of the observed sample size. Beyond this point, the 
variance of the richness estimate increases rapidly. 
Unless the initial sample size is very large, pro¬ 
jecting the curve to an asymptotic value usually 
requires much more than a doubling or tripling of 
the initial sample size (Chao et al. 2009), so this 
method is not always feasible, especially for hyper- 
diverse taxa (Mao & Colwell 2005). 

Another classical approach to estimating asymp¬ 
totic richness is to fit a species abundance 
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Figure 4.3 Estimation of asymptotic species richness by fitting a 
log-normal distribution to a species abundance distribution. The graph 
shows the number of species of ants in each of seven 
logarithmically-scaled abundance categories (a total of 435 species 
collected) in a long-term rainforest inventory in Costa Rica (Longino 
et al. 2002). The number of undetected species (21 additional species) is 
estimated by the area marked with horizontal hatching, yielding a 
predicted complete richness of 456 species. 


distribution (see Chapter 9), based on a single sam¬ 
ple, to a truncated parametric distribution, then 
estimate the 'missing' portion of the distribution, 
which corresponds to the undetected species in 
an assemblage. Fisher et al. (1943) pioneered this 
approach by fitting a geometric series to a large 
sample of moths captured at light traps. Relative 
incidence distributions from replicated sets of sam¬ 
ples can be treated in the same way (Longino et al. 
2002). The most widely used species abundance 
distribution for this approach is the log-normal 
(Fig. 4.3) and its variants (from Preston (1948) to 
Hubbell (2001)), but other distributions (geometric 
series, negative binomial, y, exponential, inverse 
Guassian) have also been used. The challenges of 
fitting the log-normal have been widely discussed 
(e.g. Colwell & Coddington 1994; Chao 2004; Dor- 
nelas et al. 2006; Connolly et al. 2009). One of 
the limitations of this approach is shared with the 
extrapolation of fitted parametric functions: two or 
more species abundance distributions may fit the 
data equally well, but predict quite different assem¬ 
blage richness. In addition, the species abundance 
distribution that fits best may be one that cannot 
be used to estimate undetected species, such as the 
widely used log-series distribution (Chao 2004). 

The limitations of parametric methods inspired 
the development of non-parametric richness esti¬ 
mators, which require no assumptions about an 
underlying species abundance distribution and do 
not require the fitting of either a priori or ad hoc 
models (Chao 2004). These estimators have experi¬ 
enced a meteoric increase injjsage in the past two 
decades, as species richness has become a focus of 
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biodiversity surveys and conservation issues, and a 
subject of basic research on the causes and conse¬ 
quences of species richness in natural ecosystems. 
In Box 4.1, we have listed six of the most widely 
used and best-performing indices. All the estima¬ 
tors in Box 4.1 depend on a fundamental principle 
discovered during World War II by Alan Turing and 
I.J. Good (as reported by Good (1953, 2000)), while 
cracking the military codes of the German Wehrma- 
cht Enigma coding machine: the abundances of the 
very rarest species or their frequencies in a sample 
or set of samples can be used to estimate the fre¬ 
quencies of undetected species. All of the estima¬ 
tors in Box 4.1 correct the observed richness S 0 b s by 
adding a term based on the number of species rep¬ 
resented in a single abundance sample by only one 
individual ( singletons ), by two ( doubletons ), or by a 
few individuals. For incidence data, the added term 
is based on the frequencies of species represented in 
only one ( uniques) sample, in two ( duplicates ), or in 
a few replicate incidence samples. 

Fig. 4.4 shows how well one of these estima¬ 
tors, Chao2, estimates the asymptotic richness of the 
seedbank dataset of Figure 4.1, based on sets of rtf 
samples chosen at random. The estimator stabilizes 
after about 30 samples have been pooled. When all 
121 samples have been pooled, the estimator sug¬ 
gests that 1-2 additional species still remain unde¬ 
tected. 

Only four of the estimators in Box 4.1 (Chaol, 
ACE, and the two individual-based jackknife esti¬ 
mators) are appropriate for abundance data; the 
rest require replicated incidence data. Most of the 
incidence-based estimators were first developed, in 
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Figure 4.4 Asymptotic species richness estimated by the Chao2 
non-parametric richness estimator for the seedbank dataset of 
Fig. 4.1. Plotted values for Chao2 are means of 100 randomizations 
of sample order. The estimator stabilizes after only about 30 samples 
have been pooled. When all 121 samples have been pooled (34 
species detected), the estimator suggests that one or two additional 
species still remain undetected. 
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biological applications, for capture-recapture meth¬ 
ods of population size estimation. The number of 
samples that include Species X in a set of bio¬ 
diversity samples corresponds to the number of 
recaptures of marked Individual X in a capture- 
recapture study. In species richness estimation, the 
full assemblage of species, including those species 
not detected in the set of samples (but susceptible 
to detection), corresponds, in population size esti¬ 
mation, to the total population size, including those 
individuals never captured (but susceptible to cap¬ 
ture) (Boulinier et al. 1998; Chao 2001,2004). 

Behind the disarming simplicity of Chaol and 
Chao2 lies a rigorous body of statistical theory 
demonstrating that both are robust estimators of 
minimum richness (Shen et al. 2003). ACE and ICE 
are based on estimating sample coverage —the pro¬ 
portion of assemblage richness represented by the 
species in a single abundance sample (ACE) or in a 
set of replicated incidence samples (ICE). The esti¬ 
mators are adjusted to the 'spread' of the empirical 
species abundance (or incidence) distribution by a 
coefficient of variation term (Chao 2004). The Chaol 
and Chao2 estimators also provide a heuristic, intu¬ 
itive 'stopping rule' for biodiversity sampling: no 
additional species are expected to be found when all 
species in the sample are represented by at least two 
individuals (or samples). Extending this approach, 
Chao et al. (2009) provide equations and simple 
spreadsheet software for calculating how many 


additional individuals would be needed to sample 
100% (or any other percentage) of the asymptotic 
species richness of a region based on the samples 
already in hand. Pan et al. (2009) have recently 
extended the Chaol and Chao2 indices to provide 
an estimate of the number of shared species in mul¬ 
tiple assemblages. 

The jackknife is a general statistical technique for 
reducing the bias of an estimator by removing sub¬ 
sets of the data and recalculating the estimator with 
the reduced sample. In this application of the tech¬ 
nique, the observed number of species is a biased 
(under-) estimator of the complete assemblage rich¬ 
ness (Burnham & Overton 1979; Heltshe & Forrester 
1983; Chao 2004). For a set of m replicate incidence 
samples, the kth order jackknife reduces the bias by 
estimating richness from all sets of m-k samples. 
The first-order jackknife (Jackknifel) thus depends 
only on the uniques (species found in only one sam¬ 
ple) because the richness estimate is changed only 
when a sample that contains one of these species 
is deleted from a subset of samples. Likewise, the 
second-order jackknife (Jackknife2) depends only 
on the uniques and the duplicates (species found 
in exactly two samples). Similar expressions for 
abundance-based jackknife estimators are based on 
the number of singletons (species represented by 
exactly one individual) and doubletons (species 
represented by exactly two individuals; Burnham & 
Overton (1979)). These estimators can be derived by 
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letting the number of samples m tend to infinity in 
the equations for the incidence-based estimators. 

4.2.9 Comparing estimators of asymptotic 
species richness 

Given the diversity of asymptotic estimators that 
have been proposed, which one(s) should ecologists 
use with their data? The ideal estimator would be 
unbiased (it neither over- or under-estimates asymp¬ 
totic species richness), precise (replicates samples 
from the same assemblage produce similar esti¬ 
mates), and efficient (a relatively small number of 
individuals or samples is needed). Although there 
are many ways to estimate bias, precision, and 
efficiency (Walther & Moore 2005), none of the 
available estimators meet all these criteria for all 
datasets. Most estimators are biased because they 
chronically under-estimate true diversity (O'Hara 
2005). The Chaol estimator was formally derived 
as a minimum asymptotic estimator (Chao 1984), 
but all of the estimators should be treated as esti¬ 
mating the lower bound on species richness. Esti¬ 
mators of asymptotic species richness are often 
imprecise because they typically have large vari¬ 
ances and confidence intervals, especially for small 
data sets. This imprecision is inevitable because, by 
necessity, these estimators represent an extrapola¬ 
tion beyond the limits of the data. In contrast, rar¬ 
efaction estimators usually have smaller variances 
because they are interpolated within the range of 
the observed data. However, as noted earlier, the 
unconditional variance of richness as estimated by 
rarefaction is always larger than the variance that is 
conditional on a single sample (or set of samples). 
Finally, most estimators are not efficient and often 
exhibit 'sampling creep': the estimated asymptote 
itself increases with sample size, suggesting that the 
sample size is not large enough for the estimate to 
stabilize (e.g. Longino et al. (2002)). 

Two strategies are possible to compare the per¬ 
formance of different estimators. The first strategy 
is to use data from a small area that has been 
exhaustively sampled (or nearly so), and to define 
that assemblage as the sampling universe. As in 
rarefaction, a random subsample of these data can 
then be used to calculate^symptotic estimators 
and compare them to the known richness in the 


plot (a method first suggested by Pielou (1975), 
but popularized by Colwell & Coddington (1994)). 
For example, Butler & Chazdon (1998) collected 
seeds from 121 soils samples from a 1 ha plot, on 
a 10 x 10 m grid in tropical rainforest in Costa 
Rica, yielding 952 individual seedlings represent¬ 
ing a total of 34 tree species (Figure 4.1). Col¬ 
well & Coddington (1994) randomly rarefied these 
data, by repeatedly pooling m* samples (1 <m*< 
M), and found that the Chao2 index (illustrated 
in Fig. 4.4) and the second-order jackknife estima¬ 
tors were least biased for small m*, followed by 
the first-order jackknife and the Michaelis-Menten 
estimator. Walther & Morand (1998) used a similar 
approach with nine parasite data sets and found 
that Chao2 and the first-order jackknife performed 
best. Walther & Moore (2005), using different quan¬ 
titative measures of bias, precision, and accuracy, 
compiled the results of 14 studies that compared 
estimator performance, and concluded that, for 
most data sets, non-parametric estimators (mostly 
the Chao and jackknife estimators) performed bet¬ 
ter than extrapolated asymptotic functions or other 
parametric estimators. 

In a second strategy for comparing diversity esti¬ 
mators, the investigator specifies the true species 
richness, the pattern of relative abundance, and 
the spatial pattern of individuals in a computer- 
simulated landscape. The program then randomly 
samples individuals or plots, just as an ecologist 
would do in a field survey. The estimators are then 
calculated and compared on the basis of their abil¬ 
ity to estimate the 'true' species richness of the 
region. This kind of simulation can also be used 
to explore the effects of spatial aggregation and 
segregation, sampling efficiency, and the size and 
placement of sampling plots. Brose et al. (2003) car¬ 
ried out the most extensive analysis of this kind 
to date. In their analyses, which estimator per¬ 
formed best depended on the relative evenness 
of the rank abundance distribution, the sampling 
intensity, and the true species richness. As in the 
empirical surveys (Walther & Moore 2005), non- 
parametric estimators performed better in these 
model assemblages than extrapolated asymptotic 
curves (parametric estimators based on truncated 
distributions were not considered). One encourag¬ 
ing result was that environmental gradients and 
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spatial autocorrelation (which characterize all bio¬ 
diversity data at some spatial scales) did not have a 
serious effect on the performance of the estimators. 
These results are consistent with the findings of 
Hortal et al. (2006), who aggregated empirical data 
sets at different spatial grains and found that non- 
parametric estimators were not greatly affected by 
the spatial scale of the sampling. 

O'Hara (2005) took a hybrid approach that used 
both empirical data and simulated assemblages. He 
first fit negative binomial and Poisson log-normal 
distributions to two very extensive (but incomplete) 
sets of survey data for moths. He used these fitted 
models to generate sample data for comparing non- 
parametric estimators, parametric estimators, and 
extrapolated asymptotic curves. As in other studies, 
true species richness was greater than predicted by 
the estimators. In each comparison, only one of the 
parametric estimators had a 95% confidence inter¬ 
val that encompassed the true richness. The catch is 
that this method worked well only when the 'cor¬ 
rect' species abundance distribution was used. In 
other words, the investigator would need to know 
ahead of time that the negative binomial, Poisson 
log-normal, or some other distribution was the cor¬ 
rect one to use (which rather defeats the value of 
using non-parametric estimators). Unfortunately, in 
spite of decades of research on this topic, there is 
still no agreement on a general underlying form of 
the species abundance distribution, and there are 
difficult issues in the fitting and estimation of these 
distributions from species abundance data (see 
Chapter 10). We hope that future work may lead to 
better species richness estimators. At this time, the 
non-parametric estimators still give the best perfor¬ 
mance in empirical comparisons, and they are also 
simple, intuitive, and relatively easy to use. 

4.2.10 Software for estimating species 
richness from sample data 

Free software packages with tools for estimating 
species richness from sample data include: 

• Estimates (Colwell 2009): http://purl.oclc.org/ 
estimates 

• EcoSim (Gotelli & Entsminger 2009): http:// 
garyentsminger.com/ecosim / index.htm 


• SPADE: http://chao.stat.nthu.edu.tw/software 
CE.html 

• VEGAN (for R): http://cc.oulu.fi/~jarioksa/ 
softhelp/ vegan.html. 

4.3 Prospectus 

Estimates of species richness require special sta¬ 
tistical procedures to account for differences in 
sampling effort and abundance. For comparing 
species richness among different assemblages, we 
recommend sample-based rarefaction using uncon¬ 
ditional variances, with adjustments for the number 
of individuals sampled. Rarefaction methods for 
data that represent sampling from nature with¬ 
out replacement are still needed, for small assem¬ 
blages, as are additional estimators for the number 
of shared species in multiple samples (A. Chao, 
personal communication). For many datasets, all 
existing methods for estimating undetected species 
seem to substantially under-estimate the number 
of species present, but the best methods nonethe¬ 
less reduce the inherent undersampling bias in 
observed species counts. Non-parametric estima¬ 
tors (e.g. Chaol, Chao2) perform best in empirical 
comparisons and benchmark surveys, and have a 
more rigorous framework of sampling theory than 
parametric estimators or curve extrapolations. 

4.4 Key points 

1. Biodiversity sampling is a labour-intensive activ¬ 
ity, and sampling is often not sufficient to detect 
all or even most of the species present in an 
assemblage. 

2. Species richness counts are highly sensitive to 
the number of individuals sampled, and to 
the number, size, and spatial arrangement of 
samples. 

3. Sensitivity to sampling effort cannot be 
accounted for by scaling species richness as a 
ratio of species counts to individuals, samples, 
or any other measure of effort. 

4. Sample-based and individual-based rarefaction 
methods allow for the meaningful comparison of 
diversity samples based on equivalent numbers 
of individuals and samples. 
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5. Non-parametric estimators of species richness, 
which use information on the rare species in 
an assemblage to adjust for the number species 
present but not detected, are the most promising 
avenue for estimating the minimum number of 
species in the assemblage. 
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CHAPTER 5 


Measurement of species diversity 

Brian A. Maurer and Brian J. McGill 


5.1 Introduction 

One of the most conspicuous aspects of biodiver¬ 
sity is the fact that individual organisms are orga¬ 
nized into relatively discrete units referred to as 
species. Although there is much variability in what 
can be called a species, generally a species repre¬ 
sents a distinct genetic lineage of organisms that 
interact with the environment in similar ways and 
are generally reproductively compatible. For both 
theoretical and practical reasons, it is often desir¬ 
able to know how many species are found in a 
given region of space-time (Chapter 4) and to know 
something about how abundant each species is rel¬ 
ative to others in the same community (Chapter 9). 
This relatively simple objective, however, becomes 
greatly complicated because there are so many dif¬ 
ferent ecological circumstances in which species 
diversity is measured. Because of this there have 
been a large number of quantities suggested as 
appropriate measures of species diversity (Box 5.1) 
(Pielou 1975; Krebs 1989; Magurran 2004). This 
plethora of indices makes it difficult to evaluate 
which method is appropriate in what particular cir¬ 
cumstances. The most commonly used indices are 
used primarily because they have been used before, 
and not necessarily because they provide useful 
information. 

In this chapter we consider the problem of 
species diversity by focusing firstly on precise def¬ 
initions of the term. We then describe the neces¬ 
sary statistical sampling theory that follows from 
the definition. There are two different types of 
sampling issues, both of which are important to 
developing an understanding of species diversity. 
First, there is the issue of how ecological circum¬ 
stances act as a probabilistic 'filter' in determining 
which specific species can be found in a region. 


We \vill refer to this as the ecological sample in 
what follows. Second, when collecting data on 
species abundances within a specified ecosystem, 
it is often not appropriate to assume that every 
last individual of every species has been identified. 
Hence, much field data are subsamples of a larger, 
unknown community. We will call this the empiri¬ 
cal sample below. Empirical samples are often used 
to estimate quantities assumed to represent the 
unmeasured parameters that describe the process 
thought to have given rise to the ecological sample 
(Green & Plotkin 2007). 

No ecosystem remains unchanged across space 
and time. However, there may be ecological con¬ 
ditions that are sufficiently similar that they may 
give rise to similar levels of diversity. How is it 
possible to determine whether multiple communi¬ 
ties arrayed across space and/or time give rise to 
an increase in diversity? In other words, given a 
set of communities that can be sensibly aggregated 
into a larger entity, is it possible to partition the 
overall diversity of the aggregate into a component 
due to within-community diversity and a compo¬ 
nent attributable to between-community diversity? 
The former has been termed 'a diversity' and the 
latter '/3 diversity' (Whittaker 1975; Chapter 6). Esti¬ 
mating these diversity components in communities 
that result from some combination of deterministic 
and random processes provides a unique statistical 
challenge. 

5.2 State of the art 

First it is necessary to define the underlying struc¬ 
ture of the statistical population from which the 
ecological sample of a community is derived. To 
do this, we start with a definition of a community 
for which a measure of species diversity is desired. 
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Box 5.1 Measures of species diversity and eveness 


There are a number of descriptors that are based on some 
concept such as evenness, diversity, or dominance that are 
not derived from any probability distribution. Of course 
many of the parameters from probability distributions could 
fit these goals as well (Box 9.2). The notation used 
throughout this section is: A/, is the abundance of the /th 
species after sorting (so A/, is the abundance of the most 
abundant species), 5 is the number of species observed, 
and N is the total abundance (A/ = 27 ;=1 A/;) and p, is the 
proportion of abundance for species ;(p, = A/,/A/). 5, is the 
number of species with an abundance /, so Si is the number 
of singletons. To provide some order, we have tried to 
group indices with similar goals and to use a consistent 
notation. 

Number of individuals (N) - Total number of 
individuals. This is another easily calculated yet powerful 
descriptor. Note that in neutral theory N is often denoted 
by J, but we use the more traditional N here. 

I. Richness metrics (S) 

Richness (S) - Species richness: the total number of 
species identified in the sample. It is among the simplest 
descriptors of community structure, 

Margalef diversity (S Marga i e f) - Margalef (Clifford & 
Stephenson 1975) noted that species richness increases 
with N, and in particular increases non-linearly and roughly 
logarithmically with N. S Ma rgaief = (S - 1)/ In N 

Menhinick diversity (5 Me nhinick) - In a similar vein 
(Clifford & Stephenson 1975), Menhinick proposed 
adjusting species richness by the similarly shaped square 
root of A/. -SMenhinick = S/^/TY 

Chao estimated diversity (S C h ao ) - Another way to 
make species richness 5 comparable between sites with 
different sample sizes N is to extrapolate to the richness of 
an infinite sample. Chao (1987) proposed a simple robust 
estimator for this: 5 C hao = 5 + 5, 2 /(25 2 ). 

Chao estimated variance - Although not an estimate 
of species richness, Chao provided an analytical formula for 
the variance in S C h ao that can be used to place error bars on 
Schao(5chao ± 1 -QeVSchaovar) and is given by 
^ChaoVar = ^2 [(Si /S 2 ) 4 /4 + (Si /S 2 ) 3 + (Si /S 2 ) 2 /2]. 

II. Diversity metrics (D) 

Diversity is traditionally taken to be a function of both 
richness and evenness, with less even communities being 


less diverse than their richness alone would indicate (should 
a species with only one individual count towards diversity 
the same as an abundant species?). 

Shannon diversity (//' or Shannon) - Shannon's 
information theory can be used to calculate the information 
in a community as an estimate of diversity. There is a finite 
population size version known as Brillouin's index 
(equation 5.9, main text) which should probably be used 
but usually isn't. ©shannon = -%Pi Inp, 

Simpson diversity (1 ID orO S impson) - Simpson 
noted that D = Zp, 2 gave the probability that two 
individuals drawn at random from an infinite community 
would belong to the same species. This has precedent in 
population genetics of the probability of getting two alleles 
the same. As such D is the inverse of diversity and some 
form of inverse is needed to create a diversity index. 
Although variations including 1 - D (related to the variance 
within species and to Hurlbert's PIE below) and —ln(D) 
(related to the Hill measure H 2 ) have been used, the most 
common way (e.g. MacArthur 1972) of converting 
homogeneity into diversity is Dsi mpS on = 1 /©• 

Hurlbert diversity (1 - PIE or £> Hur ibert) - Hurlbert 
(1971) argued that a biologically meaningful measure of 
diversity is the odds that a given interaction between two 
species is interspecific (PIE = 1 - D = probability of 
interspecific encounter). With a correction for finite sample 
size we have D Hu ribm = 1 - i7(n,/A/)[(/), - 1)/(A/ - 1)j. 

Diversity numbers (©Hina) - Hill (1973) proposed 
using information-based criteria to obtain 'weighted' 
counts of species, based on the degree of dominance. Also 
see Chapter 6. The weighted counts, H a , are obtained by 
choosing an appropriate value of a, with small a 
weighting rare species most and large a weighting 
common species most. Note that the Hill numbers are the 
exponential of the Renyi entropies given in equation 5.6: 

Hoc = exp(/?„). This set of measures contains many 
common measures as special cases. H- x = 1 /p s 
(reciprocal of the proportional abundance of the rarest 
species), H 0 = 5, W, = exp(W'), which some people have 
argued should be used instead of H' (where H' is the 
Shannon diversity orD Shan „ on above), H 2 = 1/£>(i.e. Ds irTl p Son 
above), and H x = 1 /C Re i) (reciprocal of the Berger-Parker 
index). Kempton (1979) found that a between 0 and 0.5 
provided the best discrimination in empirical data known to 
come from different communities. 

H a = | Zp, J 
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III. Evenness metrics (E) 

Evenness is a measure of how different the abundances of 
the species in a community are from each other (Smith & 
Wilson 1996). A community where every species had the 
same abundance would be perfectly even. All natural 
communities are highly uneven, so evenness is a relative 
statement. Most evenness indices are scaled to 
approximately run from 0 = maximally uneven to 
1 = perfectly even. 

Shannon evenness (J' or Eshannon) - If diversity is a 
mixture of richness and evenness, then removing richness 
should produce evenness. This is the logic behind 
Shannon's evenness measure; the highest value of Dshannon 
when all species are equally abundant can readily be seen 
to be ln(5) so dividing by In(5) will give an index from 0 to 

1 ■ ^Shannon = ^shannon/ Iu(5) 

Simpson evenness (1 IDIS or E S j mp son) - The same 
logic applies to Simpson's diversity, giving 

^Simpson ~ Dsirrpson/S. 

Camargo evenness (or E Ca margo) - The highest 
possible evenness is when p ; = p- = 1/S, so Camargo et al. 
(1993) suggested a direct measurement of deviation from 
this ideal. F Ca margo = 1 - T, |p,. - py|/5, where the sum is 
taken over i = 1 ...S,j = i+ 1 ... 5 . 

Smith Wilson evenness (E var or E Sm ithwiison) - 
Smith and Wilson (Smith & Wilson 1996) reviewed an array 
of evenness indices and assessed them on some core 
properties. Two core properties they identified are spanning 
the whole range 0-1 and being independent of unit of 
measure (evenness of biomass measured in grams should 
be equal to evenness of biomass measured in kilograms). 
They invented an index that performed well on these goals, 
which they called f V3I . The formula is based on the variance 
of log abundances (centered on the mean of log 
abundances) then appropriately scaled to cover 0-1. 

Fsmithwitson = 1 - arctan ^ E(ln(n,) - /x ln ) 2 ] where 
Pi n = j£ln(n,). 

RAD beta or NHC evenness (E NH c) - The slope of 
the rank abundance diagram (RAD) has long been 
interpreted as a measure of evenness (a perfectly horizontal 
line would represent perfect evenness). Nee, Harvey, and 
Cotgreave (Nee et al. 1992) proposed taking the slope of 
the regression line through the points in the RAD as a 
measure of evenness. This runs from (—oo,0). Some 
authors (e.g. Smith & Wilson 1996) rescale this to go from 
0 to 1 -i.e. -2/arctan (|3), but we prefer to keep the 
simple geometric interpretation |3, where |3 is the OLS 
slope of log abundance vs. rescaled rank (divide by 5 so 
rank goes from 1 /5 to 1). F NH c = j8 as above. 


Diversity number ratios (£ H iii) - Using diversity 
numbers (Hill 1973), one can take ratios of different 
diversity numbers F a ,p = H^/Hp to express the degree of 
evenness among species within an ecological sample. The 
most natural case is (3 = 0, then H 0 = 5, which is the 
maximal value (i.e. on a perfectly even community with 
Pi = N/S) of Hoc for all a. For example, log(W 2 )/ log(W 0 ) 
gives Shannon evenness. 

IV. Dominance or common species metrics (C) 

Dominance is a measure of how much one or a few species 
dominate the community numerically (McNaughton & Wolf 
1970). In some ways it is the inverse of evenness, but it is 
specifically focused on the right side of the SAD (very 
common species). 

Absolute dominance (C A b*) - The simplest measure 
of dominance is simply A/-|, the abundance of the most 
abundant species. Although it might seem that A/ t would 
be so heavily dependent on total abundance N as to be 
useless, in some systems A/, can stay surprisingly constant 
even while N varies. C A b S = A/i 

Relative dominance (Berger-Parker) (C Re i) - The 
easiest way to correct for the effects of N is to divide by N, 
producing p, the relative abundance of the most abundant 
species (Berger & Parker 1970). Cn e i =Pi 

McNaughton dominance (C M cNaught) - McNaughton 
(1970) made a more robust measure that was less subject 
to the vagaries of a single species by looking at the 
proportional abundance of the two most abundant species 
(and rescaling to 0-100). A similar index based on the 
abundance of the three most abundant species was made 
by Misra and Misra (1981). 

GlcNaught = (Pi + P 2 ) x 100 = [(A/) + N 2 )/2N] x 100 

V. High rarity metrics (R) 

Rarity-the opposite of dominance metrics-focus 
on an assessment of rare species. Since abundance is 
bounded at 1, rarity metrics focus on the number of 
species with specific abundances in contrast to 
commonness metrics which focus on the abundance of 
specific species. 

LogSkew (/?u>gskew) - Skew is the third moment of a 
probability distribution, measuring asymmetry. Right skew 
(positive numbers) indicates more probability on the right 
(abundant) side. Left skew (negative numbers) indicates 
more probability on the left side. All species abundance 
distributions are strongly right skewed on an arithmetic 

Continued 
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Box 5.1 ( Continued) 

scale, so the more interesting measure is skew on the log 
scale. This measures asymmetry relative to the log-normal. 
A negative number indicates an excess of rare species 
(McGill 2003). /? L „ gSte „ = [Ifoogfa) - ^f/s\ / 

pdogfo) - m) 2 a] ' 5/(5 - 2)V[(5 - 1)/5], where 
ju is the mean of log(/ 7 ,). 

% Singletons (/?si ng ieton) - A simple measure of rare 
species is to count the number of singletons, /?singieton = Si 

PctRare1% (R 1% ) - Like the first two dominance 
measures, /?singieton focuses exclusively on one abundance 
class, potentially making it a noisy metric. By focusing on 
multiple abundance classes, this problem can be avoided. 
The challenge is to define which species count as rare. A 
simple one is to call any species with an abundance less 
than 1 % of total abundance rare. A major shortcoming of 
this is that no species can have an abundance less than 1 % 
of N if N < 100. This measure is only useful when N is at 
least several hundred. Ri% = (5, +S 2 + ... +5 r )/5, 
where r is the largest integer less than 0.01 x N. 

PctRare5% (/f 5 %) - An alternative, more expansive, 
definition that can work with N > 20 is that a species is 
rare if its abundance is < 5% of N. 

R 5 % = (5, + S 2 + ... + S T )/S, where T is the largest 
integer less than 0.05 x N 

PctRareN/S (Raw) - Both the 5% and 1 % cut-offs are 
relative to N only, not 5. A thousand individuals with 500 
species is bound to have many rare species by these 
definitions but 1000 individuals with 10 species will be very 
different. A simple measure that attempts to correct for this 
is to count a species as rare if its abundance is < N/S. NIS 
gives the average abundance of a species. Because of the 
strong right skew, the average abundance will be greater 
than the median (50th percentile) abundance and thus 
quite large. R N/S = (5, + S 2 +... + 5 T )/5, where T is the 
largest integer less than NIS 

VI. Semi-parametric metrics 

Several metrics are not parameters from the probability 
distributions listed in Box 9.1, but are related to those or 
other probability distributions. 

Fisher's a (a or S F j S her) - The log-series distribution 
parameter is called c, but as discussed in Box 9.2 for the 
log-series, <x can be calculated as a function of N and c by 
oc = A/(1 - c)/c. Although typically smaller (sometimes by 
a factor of 2-10) than S, it is strongly correlated with 5. It 
has been recommended as a sample-size-independent 
estimator of richness (Rosenzwei^1995). 


Lognormal CV - As discussed in the section on the 
lognormal distribution (Box 9.2) (also see Limpert et al. 
2001), the coefficient of variation CV = ^/o is not a 
parameter but is perhaps the best single descriptor of shape 
for the log-normal (making it analogous to the gamma and 
Weibull shape parameters). A high CV indicates many rare 
species (high unevenness). 

Prop LN fx*, Prop LN a*, Prop LN CV - In the 
log-normal distribution, both /x* and cr* are in units of 
abundance and scale with increasing sample size 
N (i.e. fx = log (fx*) and cr = log(cr*) scale as log(/V)). 
One way to adjust for this is to calculate the log-normal 
parameters (mean, standard deviation, CV) on the log of 
the relative abundances, log(p,). This removes the heavy 
dependence on N. 

Gambin a-The Poisson-gamma distribution (which 
leads to the logseries distribution) is discussed in Box 9.2. 
An alternative sampling distribution to the Poisson is the 
binomial (Green & Plotkin 2007), where the gamma 
distribution gives the probability p of the species appearing, 
which is then passed through binomial sampling. This gives 
the binomial gamma, which might be a good model for 
SADs (Ugland et al. 2007). Since the gamma distribution 
runs to oo, it is necessary to truncate the right tail (say at 
the 99th percentile). By scaling without loss of generality so 
that the maximum value is 1, the binomial gamma or 
GamBin (Ugland et al. 2007) is well defined with only one 
parameter, the gamma shape parameter. It has been 
shown that the GamBin fits many datasets well and that 
the parameter a may be a good proxy for the habitat 
complexity from which the community is sampled (Ugland 
et al. 2007). 

/Blog* and /‘logit - As discussed in Chapter 9, 
fitting a sigmoidal logistic function to the empirical 
cumulative distribution function (ECDF) on a log 
proportional abundance scale is tantamount to 
hypothesizing a log-logit probability distribution (Evans 
et al. 1993; Williamson & Gaston 2005). The logistic 
function (and log-logit probability distribution) has two 
parameters: / is a scale parameter and gives the location of 
the inflection point on the x-axis. In this context, since the 
inflection occurs at 50% of species accumulated, it gives 
the median relative abundance on a log scale. Since there 
are many more rare species than common, this is 
tantamount to a form of measurement of how many rare 
species there are. Similarly, m represents the slope of the 
function at the intercept, and as such is a proxy for 
evenness. When m = 0 every species has different 
abundance and when m = oo the logit function 
becomes a step function and all species are 
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equally abundant. ECDF(p) = 1 /{1 + exp [-m x (p - /)]}, 
which can be fitted to the ECDF by least squares. 

THgenlogr Tgenlog , 3tld dgenlog “The loqit function 
assumes that the ECDF is symmetric about the inflection 
point, which it may not be. A generalized logit with three 
parameters can allow asymmetry. By the appropriate choice 

Assume that a community can be described by a 
set of / individual 'sites' occupied by a single indi¬ 
vidual from one of S species. For some species, an 
individual may not be a discrete unit, but may exist 
as a distributed network of 'nodes', such as a tree 
species that generates above-ground stems from a 
network of roots. In such situations, it is impor¬ 
tant to define precisely what part of the network is 
being counted (e.g. stems). To model this situation, 
let the random variable X, ; be defined as follows: 
Xjj = 1 if site j is occupied by species i and X, , = 0 
otherwise. Since each site can only be occupied by 
a single individual, it is convenient to combine the 
random variables Xjj . across each species at site j 
as a single random vector X r The limiting distri¬ 
bution of Xj can be generally considered to be a 
multinomial distribution with a single observation. 
Given this general structure, we can calculate the 
expected value of the random vector X , over species 
as a vector of probabilities q ; = [g, ; ], where q,j is the 
probability that species i is found on site j. Collect¬ 
ing the probability vectors from all sites into a single 
matrix gives the S x / matrix Q, which summarizes 
the probabilities of species distributed across all 
sites. The expected abundance (N,) of each species is 
obtained as 

Ni = (5.1) 

The vector N = [N) N 2 .. .N s ] is sometimes called 
the species abundance distribution (see Chapter 9) 
and has been a major focus of research in ecol¬ 
ogy for the past several decades. Because of the 
underlying probabilistic definition of N, it is con¬ 
venient to consider it as a random vector. An active 
area of current investigation is the examination of 
alternative stochastic processes that can model the 
evolution of N in space-time (Alonso & McKane 
2004; Etienne 2005). Consideration of these models 
is beyond the scope of this chapter; suffice it to 


of a three-parameter sigmoidal logit-like function, the 
parameters m and / can retain their meanings and a single 
parameter a can describe the degree of asymmetry. 

ECDF(p) = 1/{1 tax exp[-m x (p - /)] 1/a }, which can 
be fitted to the ECDF by least squares. 


say that the distribution of N that is most useful 
is the equilibrium distribution of the underlying 
stochastic process generating the ecological sam¬ 
ple. Here we will use the multinomial distribution 
to represent this limiting distribution. Finally, it is 
also useful to consider the relative species abun¬ 
dance distribution to be represented by the random 
vector 

p = N/N (5.2) 

where N = £, Nj. 

5.2.1 Species diversity as variance 

We are now in a position to consider definitions 
of species diversity. Diversity is often intended to 
represent two different aspects of the species abun¬ 
dance distribution. The first is termed 'species rich¬ 
ness' or, simply, the number of species in the eco¬ 
logical sample. For reasons that will become evi¬ 
dent shortly, the number of species in the ecolog¬ 
ical sample may not be equal to S (the number 
of species that have nonzero probabilities of being 
found in at least one site in the community). In other 
words, it is possible that n, might equal zero for 
some species in a particular ecological sample, even 
though the species could possibly be found in the 
community. Let S e be the number of species with 
non-zero abundances in a single ecological sample, 
then obviously S e < S is the species richness of the 
sample. The other component of species diversity 
is the degree to which the relative abundances are 
similar among species. This has been called 'even¬ 
ness' in the ecological literature, but in actuality the 
underlying concept of interest is the covariance in 
relative abundances among species. 

There are two sources of variation in relative 
abundances among species. The first is the vari¬ 
ation within a species and the second is the 
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variability among species. The variance within a 
species is 

var(p,) = p,(l — pi) (5.3) 

where p, is the z th element of the vector of rel¬ 
ative abundances (p). Since relative abundances 
of species are necessarily correlated, a measure of 
between species variability is the covariance of the 
relative frequencies of species i and k, that is 

cov(p, , p k ) = -piPk (5.4) 

The total variance across all species is obtained by 
summing equations (5.3) and (5.4) across all species 
and pairs of species, which gives 

V = 1 - St pf - 2£j <k pi p k (5.5) 

In the two extreme cases when all species are 
equally abundant or when all species have zero 
abundance except for one, V reaches its minimum 
value of zero (no variability among species). Thus, 
this variance itself is probably not useful as a mea¬ 
sure of species diversity because of this. However, 
equation (5.5) is useful heuristically because it illus¬ 
trates how the two separate aspects of species diver¬ 
sity, richness and evenness, might be related. 

The two terms in equation (5.5) represent dif¬ 
ferent aspects of species diversity. The quantity 
D = E, pf is the familiar metric of species diver¬ 
sity first suggested by Simpson (1949). It has often 
been called a measure of 'dominance'. In equa¬ 
tion (5.5), 1 — D(= 1 — Ep 2 ) represents the total vari¬ 
ance attributable to within-species variability. D 
is well known to be correlated with species rich¬ 
ness, and in the context used here it could be con¬ 
sidered to be a probabilistic measure of richness. 
As a measure of species diversity, however, it is 
incomplete because it does not include information 
about the variability in relative abundances among 
species given by the last term in equation 5.5. The 
last term in equation 5.5 is particularly interest¬ 
ing because it is a measure of the degree to which 
abundances covary among species. Intuitively, the 
summed covariances in relative abundances among 
species capture the essence of the 'evenness' com¬ 
ponent of species diversity. 

To summarize, if species diversity is defined as 
the number and relative abundances of species 
within a community, one way to represent it is to 


partition the total variation in species abundances 
among species into a within-species component and 
a between-species component. The within-species 
component represents the richness aspect of species 
diversity and the between-species component rep¬ 
resents the evenness aspect. 

5.2.2 Species diversity as information 

In the previous section, species diversity was 
defined by partitioning the total variance of abun¬ 
dances across all species in the ecological sample. 
An alternative approach developed in the early 
1960s was based on measuring the information 
content of a long string of symbols developed by 
information theorists (Pielou 1975 and references 
therein). The basic idea of the analogy is to view 
an ecological sample of species as a 'message' with 
individual organisms as pieces of 'information'. The 
relevant information is the taxon to which each 
organism belongs, and the measurement of this 
'taxonomic information' is obtained from the rel¬ 
ative abundances of species. A general measure 
of information content per symbol in an infinitely 
large set of symbols for which some 'code' exists is 
given by 

= In [£, pf] /(I — a) (5.6) 

where a is an arbitrary integer (Hill 1973; Pielou 
1975). These are called the Renyi entropies of order 
a. Different values for a produce different weight¬ 
ings of the information content inherent in the rel¬ 
ative abundances of species with low values of a 
(< 0) weighting in favour of rare species (in the 
limit R_oo is a function only of the p, of the rarest 
species) and high values of a emphasizing weight¬ 
ing in favour of common species (again R <*, being 
a function only of the p, for the most common 
species). Three values of a are of particular interest 
to ecologists. The first, Ro, is simply the logarithm 
of the number of species in the ecological sample. 
When a = 2, equation (5.6) yields I ?2 = — log D, that 
is, the negative logarithm of Simpson's diversity 
measure. The final value of interest is the limit of 
equation (5.6) when a approaches 1, which yields 

H' = Ri = -E,p, log p, (5.7) 
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This is the well-known Shannon measure of species 
diversity (Pielou 1975), which is widely used. Hill 
numbers, which are simply the exponent of Renyi 
entropy (H a = exp(R a )), are also commonly used 
in ecology (Box 5.1 and Hill, 1973) and Chapter 6, 
where Hill number H a is denoted q D. 

The appeal of equation (5.6) as a measure of 
diversity is that it is convenient to define even¬ 
ness quantitatively. Intuitively, evenness should be 
greatest if all species are equally common. If this 
is the case, equation (5.6) yields a value of log 
S e regardless of what particular value a takes on. 
Hence, it is sometimes useful to rescale R a by divid¬ 
ing it by its theoretical maximum, yielding 

R:=R a /logS e (5.8) 

Rj (also denoted /') has been widely used as mea¬ 
sure of evenness in the ecological literature. 

Pielou (1975) points out that if there is a finite 
number of individuals in an ecological sample, then 
the information content per species for that particu¬ 
lar sample is 

H B =(l/N)log[(N\)/n i N i \] (5.9) 

This form of information diversity is related to 
H' = R\ because as the values of the abundances 
of species become very large, Hg converges on 
the Shannon diversity, H' = R\ (Pielou 1975). How¬ 
ever, for small collections of individuals (i.e. small, 
fully censused communities), equation (5.9) is the 
appropriate measure of information. It has become 
known as the Brillouin index and takes on values 
slightly less than Shannon values (Hg < H = Rj). 

5.2.3 Traditional measures of various types 
of diversity 

Given that species diversity has at least two differ¬ 
ent general formulations, that is, it can represent 
a partitioning of abundance in a community into 
between- and within-species variance components 
or it can represent shared or mutual information 
among species, a large number of measurements 
have been suggested to represent these different 
aspects of species diversity. Here we survey these 
measurements and indicate how they relate to our 
distinction between species diversity as variance 
and species diversity as information. 


We have classified species diversity measure¬ 
ments into six categories based on how researchers 
have proposed to use them (Box 5.1). The first set 
of metrics contains those that attempt to express 
some basic aspect of 'richness'. The idea behind 
these metrics is to express some aspect of the 
number of species in the ecological sample. Some, 
such as Chao's estimators (Chao 1987), attempt to 
use information from an empirical sample to infer 
the species richness of the underlying ecological 
sample. 

A second set of metrics used widely in the liter¬ 
ature we refer to as 'diversity' metrics and include 
the most widely used metrics. In this context, diver¬ 
sity is used to mean a combination of both rich¬ 
ness and evenness. Simpson's diversity metric is 
the within-species component of variance discussed 
above, but also is related to the information concept 
of diversity. Shannon's diversity metric ( H' or R\) 
is probably the most commonly used expression of 
species diversity. 

Evenness metrics all attempt to examine how 
abundance is apportioned among species within 
a community. The basic concept underlying all of 
these measurements is that evenness is highest 
when a community is not dominated by a few 
species of very high abundance or equivalently that 
all species have an equal abundance. Low evenness 
implies that most species in the community are very 
rare, and consequently may contribute very little to 
the underlying ecological role the community plays 
within the ecosystem that contains it. 

Dominance metrics are in many ways the con¬ 
verse of evenness. If the scientific objectives of a 
study focus on the most common species in a com¬ 
munity, then dominance measures may be the most 
appropriate descriptors of species diversity. Like¬ 
wise, in some studies it may be more important to 
focus on the rarest species. This might be particu¬ 
larly true in conservation studies where rare species 
may be of particular interest in determining the 
value of locations for the conservation of biological 
diversity. For such studies there are a variety of met¬ 
rics that focus on the number of rare species found 
in a community. 

Finally, a variety of metrics are based on var¬ 
ious parametric or non-parametric descriptions 
of the probability distribution underlying the 
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apportionment of diversity among species. Some 
of these metrics are related to probability distri¬ 
butions that arise from assuming a certain type 
of mechanism underlying the dynamics of abun¬ 
dances among species over time (see Box 9.2 and 
Chapter 9). Generally, these metrics should be used 
to fit parameters in specific models that might 
underlie the structure of a community. 

5.2.4 Addressing the difference between the 
empirical and ecological samples: estimating 
species diversity components using empirical 
samples 

A fundamental difficulty that has not been 
addressed up to this point is the nature of the 
'object' being measured when ecologists collect 
information on the abundances of species at a spe¬ 
cific location. This 'object' is typically called a 'com¬ 
munity', and is defined as all the organisms belong¬ 
ing to a set of species found at a given point in space 
and time. The existence of such an object in any real 
sense might be questioned on many grounds (Mau¬ 
rer 1999; Ricklefs 2008), yet there is enough accu¬ 
mulated evidence to suggest that counting organ¬ 
isms of different kinds of species in local regions 
of space-time is of enormous practical value (Chap¬ 
ters 17,18, and 20). Here we focus on how to analyse 
and interpret the data obtained from such counts 
given that not all individuals can be counted and 
some species that have appreciable populations in 
the region may in fact not show up in these empir¬ 
ically derived counts. In practice, nearly all data 
obtained by ecologists form an empirical sample 
rather than an ecological sample. It is therefore tech¬ 
nically incorrect to calculate a diversity measure 
(previous section and Box 5.1) on an empirical sam¬ 
ple and claim it is the correct value for the ecological 
sample. 

Estimating the number of species, S e , in an eco¬ 
logical sample from empirical samples assumes that 
in a local community conditions remain constant 
enough over the sample period to assume that there 
are no changes in the relative abundances and inci¬ 
dence of species. If this is the case, then one can 
assume a 'collector's curve' exists, so that as the 
total number of individuals sampled increases (the 


size of the empirical sample approaches the size of 
the ecological sample), the total number of species 
identified begins to asymptote, reaching a theoret¬ 
ical maximum of S e . S e is commonly denoted just 
S, but it is expected in practice to recognize the 
distinction between the empirical sample and the 
ecological sample, and use one of the techniques 
below to estimate this value despite the notational 
imprecision. 

There are two basic approaches to estimating S e . 
First, it is possible to assume that some parameter¬ 
ized distribution function can be used as a model 
for a given species abundance distribution. If this 
is true, then at least for some statistical distribu¬ 
tions, S e is a parameter (or function of parame¬ 
ters) of the distribution that can be estimated using 
a sufficiently large empirical sample (Pielou 1975; 
Magurran 2004). The second general approach is 
to observe and extrapolate the empirical pattern 
of accumulation of species as the number of indi¬ 
viduals in the empirical sample accumulates. The 
problem with this approach is that there is no log¬ 
ical way of choosing the sequences with which 
individuals in the empirical sample are accumu¬ 
lated. If there are n individuals in an empirical 
sample, there are n\ possible ways of accumulating 
individuals. Estimates of accumulation curves can 
be constructed either by random sampling (with¬ 
out replacement) samples of various sizes from the 
empirical sample or by examining the average rate 
of accumulation using rarefaction (Simberloff 1972). 
More details on estimating species richness are dis¬ 
cussed in Chapters 4 and 20. 

Estimating the diversity and richness of ecologi¬ 
cal samples is limited by the amount of information 
available on the ecological sample being studied. 
Generally, both the total number of organisms, N, 
and the number of species, S e , in the ecological 
sample are unknown. If both of these quantities 
are large, then any single empirical sample may be 
inadequate to fully characterize the entire ecologi¬ 
cal sample (Pielou 1975; Peet 1974; Magurran 2004). 
Empirical samples that are much smaller than the 
ecological sample are unlikely to contain all species 
found in the ecological sample. In particular, rare 
species may show up in relatively few empirical 
samples. 
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The solution to the dilemma posed in the pre¬ 
vious paragraph is to examine the behaviour 
of species diversity measurements among many 
empirical samples taken of the same ecological sam¬ 
ple (also see Chapter 9). By taking multiple empir¬ 
ical samples from a single empirical sample, we 
gain information about the variance involved in 
the empirical sampling process. If a certain num¬ 
ber of sampling units are drawn from the larger 
ecological sample, then it is possible to calculate 
measures of species diversity as a function of the 
number of sample units aggregated (Pielou 1975; 
Magurran 2004). This assumes, of course, that each 
sampling unit is drawn from the same ecological 
sample. On the other hand, if conditions change 
across space and time, then aggregating sampling 
units may not be appropriate. This assumption may 
be problematic if most ecological communities are 
open systems (Maurer 1999). In the next section we 
consider how to evaluate whether several empirical 
samples are drawn from the same empirical sample. 
It is still common in practice to ignore the distinc¬ 
tion between the empirical sample and the ecologi¬ 
cal sample, and simply calculate diversity statistics 
(other than richness discussed above) simply on the 
empirical sample and report them as if they were 
the true value for the ecological sample. Despite 
being common practice, this is incorrect (especially 
for small samples) and the approach just outlined is 
superior. 


5.2.5 Testing for heterogeneity among 
ecological samples 

In the simplest case, suppose two empirical sam¬ 
ples have been obtained from a specified location. 
The question to be answered is whether these two 
samples can be considered to be samples of a sin¬ 
gle larger community or whether they are different. 
The species abundance distribution for each loca¬ 
tion can written as a vector where / = 1,2 indexes 
locations. The abundance for species i at location 
j is given by n tj . The relative abundances are 
then 


(5.10) 


The index i goes from 1 to S c , which is the number 
of species found in at least one of the samples. Note 
that some of the relative abundances may be zero. 
Writing the relative abundances as a vector gives 
two relative abundance vectors pi and p 2 . Finally, 
we can calculate the relative abundance vector for 
the combined two samples as 


p,= 


(&i)pi + (jl n ‘ 2 ) Pz / 


(5.11) 


Here we assume that each of the abundance dis¬ 
tributions can be approximated by a multinomial 
distribution. If this is the case, then we have two 
candidate models to describe the data. The first 
model assumes the two samples come from the 
same community, hence there is a single multino¬ 
mial distribution that describes both samples and 
there are S c parameters (the actual relative abun¬ 
dances in the single ecological sample). Note that 
we here assume that S c = S,,, that is, all the species 
in the ecological sample are found in at least one 
of the two empirical samples. The second model 
assumes that each empirical sample comes from 
a different ecological sample, which means that 
we would have two different multinomial distri¬ 
butions, each with S c parameters (the relative fre¬ 
quencies of each species in each of the two different 
communities). The second model requires twice as 
many parameters to describe the data as the first 
model. 

To compare such models, we suggest using the 
information theoretic approach described by Burn¬ 
ham (2002). The approach is based on estimating 
the log likelihood of each model given the data and 
substituting the empirical estimates of the parame¬ 
ters into the likelihood function. Interestingly, for 
the multinomial distribution, the negative log like¬ 
lihood function for a given data set is simply the 
Shannon information measure times the number of 
individuals in the sample. Letting H lc be the Shan¬ 
non diversity for the combined data, the negative 
log likelihood for the model assuming only a single 
ecological sample is 

L c = (E,' 1 ' 1 + E,"' 2 ) Hlc 


Pij =n lt /J2 n ij 


(5.12) 
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For the second model, which assumes two different 
ecological samples, the negative log likelihood (L 2 ) 
is obtained as 

h = (E, «-t) + (E, n ' 2 ) ( 5 - 13 ) 

where H n and H 12 are the Shannon diversities for 
each of the separate empirical samples. The log like¬ 
lihoods for the two models are then compared by 
calculating the respective Akaike Information Cri¬ 
terion (AIC) for each model: 

AIC c = 2 (L c + S c ) (5.14) 

AIC 2 = 2(L 2 +2S c ) (5.15) 

The best model is the one which has the lowest AIC. 
Generally, a difference between AICs of 2.0 or more 
indicates that the model with the lowest AIC has 
'significantly' more support from the data (Burn¬ 
ham & Anderson 1998). Furthermore, it is possible 
to calculate model weights using AIC differences. 
The interested reader is referred to Burnham (1998) 
for further details. 

This procedure can be generalized to evaluate 
whether several different empirical samples come 
from the same ecological sample. As the number 
of empirical samples being compared increases, the 
number of possible models increases rapidly, mak¬ 
ing it impractical to compute all possible compar¬ 
isons. In such cases it may be best to use some 
independent criterion (such as distance between 
samples) to group empirical samples into a small 
number of aggregate samples that can be examined 
using AICs. 

The model selection procedure provides a basis 
for asking questions about so-called '3 diversity'. In 
the example described above, the first model, which 
assumes that the two empirical samples are drawn 
from the same ecological sample, there is no 3 
diversity. The two samples are describing the same 
relative abundance distribution. The second model 
assumes that the two empirical samples are drawn 
from different ecological samples, which implies 
that there is |3 diversity (i.e. a turnover in abun¬ 
dances among communities). The degree to which 
the second model is supported by the data is related 
to how much 3 diversity exists between the two 
samples (Chapter 6). With several sites sampled, it 


is also possible to partition diversity into between- 
site (3) and within-site (a) diversity components 
(Chapter 6) (Whittaker 1975; Lande 1996; Crist et al. 
2003; Crist & Veech 2006). 

5.3 Prospectus 

While easy to conceptualize, diversity (and even¬ 
ness) are hard to measure. We present a basic 
framework here. Several developments are needed 
to provide a truly firm foundation to the measure¬ 
ment of diversity. First, more attention to and devel¬ 
opment of methods to account for the fact that col¬ 
lected data are sampled data are needed (i.e. the 
distinction between the empirical sample and the 
ecological sample). Second, rather than developing 
new measures of diversity by ad hoc processes we 
hope to see a further focus on fundamental ideas 
like variance and information. 

5.4 Key points 

1. To properly measure diversity requires recog¬ 
nition that the data usually collected represent 
just samples (empirical samples) from the actual 
community (ecological sample), which is in turn 
a probabilistic, imperfect representation of the 
potential community. 

2. There are a great many approaches proposed 
in the literature to measure aspects of diversity, 
including richness, evenness, and the combina¬ 
tion (diversity). Most of these approaches have 
been fairly ad hoc. There are probably at least 
two or three times as many measures proposed 
as the ones we cover in Box 5.1. We have tried to 
highlight the most commonly used and success¬ 
ful measures. 

3. We present a uniform framework for building 
diversity measures. Two of the oldest measures 
of diversity, Simpson and Shannon, and their 
corresponding evenness measures, turn out to 
be directly related to two fairly deep concepts of 
diversity: variance and information. 

4. The best way to use empirical samples to get 
at the ecological sample is to take multiple 
empirical samples of the same ecological sample. 
This provides information about the variability 
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induced by the empirical sampling process and 
allows for the development of an asymptotic 
approach that can then be extrapolated to the 
properties of the ecological sample. One com¬ 
mon method is to plot the measure of inter¬ 
est vs the number of empirical samples and 
extrapolate. 


5. We present a method for testing whether two 
(or any combination of more than two) empir¬ 
ical samples are drawn from a single ecologi¬ 
cal sample using the multinomial distribution 
and likelihood/AIC methods. It turns out that 
the likelihood is directly related to Shannon's 
diversity. 



CHAPTERS 


Compositional similarity and (3 (beta) 
diversity 

Lou Jost, Anne Chao, and Robin L. Chazdon 


6.1 Introduction 

Spatial variation in species composition is one of the 
most fundamental and conspicuous features of the 
natural world. Measures of compositional differen¬ 
tiation and similarity quantify this variation. Con¬ 
servation biologists apply these measures when set¬ 
ting conservation priorities or evaluating regional 
conservation plans, and ecologists use them to 
study the homogenizing or diversifying effects of 
human activities, natural disturbances, or spatial 
variability of environmental conditions (McKinney 
and Lockwood 1999; Olden 2006; Vellend et al. 
2007). Measures of similarity and differentiation are 
also essential tools for evaluating the effects of iso¬ 
lation by distance or geographic barriers, and for 
describing changes in species composition along 
environmental gradients. 

Ecologists aim not merely to describe spatial vari¬ 
ation in community composition, but ultimately 
to understand the causal factors that produce it. 
Is spatial heterogeneity caused or maintained by 
disturbances? Do species assemblages converge or 
diverge in composition over time during succes¬ 
sion (Terborgh et al. 1996; Vandermeer et al. 2004)? 
How do environmental gradients affect the distrib¬ 
utions of species (Bray and Curtis 1957; Whittaker 
1960,1972; Tuomisto et al. 1995)? To what extent do 
compositional differences reflect neutral variation 
in species abundance due solely to dispersal lim¬ 
itation (Hubbell 2001)? Carefully chosen measures 
of similarity and differentiation can be connected 
to appropriate null models to enable hypothesis 
testing. 

In ecology, a general approach to assessing 
compositional differentiation partitions regional 


diversity (y) into within- (a) and between- (|3) 
group components, and derives similarity and dif¬ 
ferentiation measures from these components. A 
wide range of similarity, differentiation, and (3 
diversity measures are in common use, reflecting 
the many ways that assemblages can be said to 
differ in composition (reviewed by Legendre and 
Legendre 1998; Vellend 2001; Koleff et al. 2003; 
Jurasinski et al. 2009; Tuomisto 2010). 

Here, we provide a conceptual overview of 
approaches to quantifying compositional similar¬ 
ity and differentiation. We then discuss the fun¬ 
damental connection between diversity and com¬ 
positional differentiation, and show how some 
of the most important and useful similarity and 
differentiation measures are directly connected 
to diversity measures through the concept of (3 
diversity. 

6.2 State of the field 

Before we can discuss the important approaches in 
this field, we need to standardize some terms and 
introduce some mathematical concepts. Our basic 
unit of analysis is the assemblage, the set of individu¬ 
als exposed to our sampling efforts in a defined area 
or point. Ecologists generally try to delimit assem¬ 
blages so that they are spatially uniform in expected 
species composition. Assemblages can be delimited 
by uniform habitats, uniform soil types, constant 
elevational ranges, or host species, and may range 
in size from a few cubic centimetres of soil to mil¬ 
lions of hectares of forest, depending on the biology 
of the taxa being examined, as well as the ques¬ 
tion being asked. If an assemblage really is spatially 
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uniform in this sense, ecologists can make many 
samples throughout the assemblage, and each sam¬ 
ple will be a replicate drawn from the same assem¬ 
blage. When this assumption holds, incidence or 
abundance data from the replicate samples can 
be used to estimate the relative abundances of 
each species in the assemblage (Chapter 4). If the 
assemblage is not homogeneous in the statistical 
sense, it can still be characterized by its average 
properties. 

Assemblages are said to have identical compo¬ 
sitions if the relative abundance of each species in 
one assemblage is the same as the relative abun¬ 
dance of that species in the other assemblages (or, 
when ignoring abundances, they are identical if the 
lists of species in the assemblages are the same). 
Thus assemblages can be compositionally identi¬ 
cal even if they differ greatly in density (number 
of individuals per unit area or volume) or area. 
This definition separates these logically distinct fac¬ 
tors. At the other extreme, assemblages are maxi¬ 
mally dissimilar if they share no species. A mea¬ 
sure of compositional similarity between assem¬ 
blages compares the relative abundance of each 
species in each of the assemblages and returns a 
summary measure of the closeness of these relative*" 
abundances. 

A measure of relative compositional similarity 
ranges from 0 to 1, assigning a value of 0 to a set 
of assemblages that share no species, and a value of 
1 to a set of compositionally identical assemblages. 
Compositional differentiation is, loosely speaking, 
the opposite of compositional similarity. A measure 
of relative compositional differentiation ranges from 
0 to 1, assigning a value of 0 to a set of composi¬ 
tionally identical assemblages, and a value of 1 to a 
set of assemblages that share no species. Measures 
of compositional similarity and differentiation are 
really just measures of the divergence between 
the species probability distributions of each assem¬ 
blage. Divergence measures play an important role 
in statistics and information theory, and their prop¬ 
erties are well studied (Pardo 2006). 

Ecologists use similarity measures to make infer¬ 
ences about their assemblages. These inferences 
usually conceal implicit assumptions about the 
mathematical properties of the similarity measure 
used. If the similarity measure does not have 


the required properties, the ecological inferences 
will be invalid. At least the following three basic 
properties should be present in any ecologically 
useful measure of similarity. 

The most basic property is that the measure must 
truly reflect similarity. Similarity is a multifaceted 
concept, so it is not possible to define it with com¬ 
plete precision. Different similarity measures may 
legitimately differ in the way they rank sets of 
assemblages. However, the concept of similarity 
does have an unambiguous core. It is possible to 
create a sequence of assemblages whose compo¬ 
sitional similarities unambiguously decrease. Sup¬ 
pose we have a set of equally large, perfectly even 
assemblages (all species equally common), and sup¬ 
pose each assemblage is identical in composition. If 
we add unique new species to each assemblage, at 
the same abundances as the pre-existing species, the 
similarity of these assemblages must decrease. This 
is an essential part of the meaning of 'compositional 
similarity', and this core concept will be referred 
to as 'monotonicity' with respect to unambigu¬ 
ously decreasing similarity. If a measure increases 
from one set of assemblages to the next in this 
sequence, it is not measuring similarity. Some com¬ 
monly used similarity measures fail this most basic 
test. The reverse of this test applies to differentiation 
measures. 

A measure of compositional similarity should 
not be sensitive to the raw abundances of the 
species in each assemblage, but only to their rel¬ 
ative abundances. This property may be called 
'density invariance'. A measure which lacks this 
property is not measuring compositional sim¬ 
ilarity, although it may have other legitimate 
interpretations. 

A third property essential in relative similar¬ 
ity and differentiation measures (those that range 
between 0 and 1) is 'replication invariance'. Ecol¬ 
ogists often compare the similarity of a subset of 
the data (perhaps a particular genus, or age class, 
or trophic guild) to the similarity of the whole 
dataset. For example, we may want to know if 
large-bodied moths are less spatially differentiated 
in a forest landscape than the moths as a whole. 
If these part-to-whole comparisons are to make 
sense, the measure of relative similarity or differ¬ 
entiation must be invariant with respect to pooling 
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of identical subsets. Suppose, for example, that we 
are studying large, medium, and small moths from 
two sites. Suppose each set of moths has exactly 
the same number of species and exactly the same 
abundances. Each set will show exactly the same 
degree of differentiation, no matter what measure 
is used. If ecologists made a part-to-whole com¬ 
parison, looking at the relative differentiation of 
small moths compared to that of moths in gen¬ 
eral, they would have to conclude that the small 
moths are not exceptional in their degree of differ¬ 
entiation. If we want a differentiation measure to 
lead to this sensible conclusion, the measure must 
have the property known in economics as 'repli¬ 
cation invariance'. If we have N identical subsets 
of abundances, and no species are shared between 
subsets, then the community as a whole should 
have the same degree of similarity as the individual 
subsets. Imagine what would happen if this were 
not the case. Taken to an extreme, every genus in 
the dataset might show the same high site-to-site 
relative differentiation, while the community as a 
whole could be assigned a low site-to-site relative 
differentiation. This actually happens with some 
common measures of similarity and differentiation. 
(Tables 6.1 and 6.2) 


6.2.1 Measures of relative compositional 
similarity and differentiation 

Incidence-based measures of relative similarity 
Incidence-based measures of relative compositional 
similarity are based only on the presence or absence 
of species. These have always been very popular 
with ecologists. The first incidence-based similarity 
index in ecology was published at the turn of the 
last century by Jaccard (1900, 1901), and new ones 
have been proposed continuously since then. Today, 
a bewildering number of incidence-based similar¬ 
ity measures are available to ecologists. Hubalek 
(1982) listed 43 and Koleff et al. (2003) listed 
24 incidence-type similarity indices. These papers, 
along with those of Gower (1985) and Legendre and 
Legendre (1998), provide comprehensive reviews 
of incidence-based indices. Table 6.1 presents some 
important incidence-based similarity measures that 
are monotonic with decreasing similarity, density- 
invariant, and replication-invariant. Both the origi¬ 
nal Jaccard measure and the classic Sorensen mea¬ 
sure (Sorensen 1948) are widely used. Both were 
originally designed to compare two assemblages. 
They differ only in their perspective. The Jac¬ 
card index compares the number of shared species 


Table 6.1 A class of incidence-based similarity indices for comparing two assemblages and their corresponding abundance-based 
versions. 


Index 

Incidence-based 

Incidence-based 

Abundance-based 


In terms of a, b, c 

In terms of Si, 52, S 12 

(See text for details) 

Jaccard 

a 

S, 2 

uv 

a+b+c 

5, +52-5,2 

u+v-uv 

Sorensen; Dice 

23 

(2 a+b+c) 

2 E 2 . 

5,+5; 

2UV 

J7+V 

Ochiai 

a 


(uv)'/ 2 

((a+b)(a+c)) l,; 

(5, S 2 ya 

Anderberg 

a 


uv 

3+2(6 + c) 

25,+25;-35,2 

2U+2V-3UV 

Kulczynski 

3 


UV 

b+c 

5,+52-25,2 

U+V-2UV 

Kulczynski; Cody 

a . a 

2(3+6) 2 (3+C) 


\(u + v) 

Lennon etal. (2001) 
(conditional Sorensen) 

a 


uv 

3+min (6,c) 

min (5,. 5j) 

min UW) 

General 

A function of a, b, c satisfying 

Replace a, b, c by Si 2 , 

Replace a, b, c by UV, 


some conditions (see text) 

5, -5,2,52-5,2 

u( 1 - v), v(] - u) 


This table is an expanded version of Table 2 of Chao et al. (2006). Symbols: a, the number of shared species; b, the number of unique species 
in the first assemblage; c, the number of unique species in the second assemblage; Si the number of species in Assemblage 1; S 2 the number 
of species in Assemblage 2; S 12 the number of shared species; U and V: see text. 
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Table 6.2 Some similarity indices for two and multiple assemblages and their properties (density invariance, replication invariance, and monotonicity, 
given in Section 6.2), 


Index 

Two-assemblage 

Multiple-assemblage 

Density Invariance (6.2) 

Replication invariance 
(6.2) 

Monotonicity 

(6.2) 

Incidence-based 

(1) Jaccard 

Jaccard (1900) 

Koch (1957) 

Yes 

Yes 

Yes 

(2) Sorensen 

Sorensen (1948) 

Diserud and 0degaard 
(2007) 

Yes 

Yes 

Yes 

Abundance-based 

(1) Horn index 

Horn (1966) 

Chao et al. (2008) 

Yes 

Yes 

Yes 

(2) Morisita-Horn 

index 

Morisita (1959) 

Jost (2006) 

Yes 

Yes 

Yes 

(3) Additive (based on 

Lande (1996) 

Lande (1996) 

Yes 

No 

No 

Gini-Simpson) 

(4) Additive (based on 

Lande (1996) 

Lande (1996) 

Yes 

No 

No 

entropy) 

(5) Bray—Curtis 

Bray—Curtis (1957) 

Use the average of 

No 

Yes 

Yes 

(6) Percentage 

Similarity 

Renkonen(1938) 

pairwise similarities or 
the average distance 

Yes 

Yes 

Yes 

(7) Standardized 

Gower/Euclidean/ 

Minkowski 

Gower (1971, 1985) 

from an individual 
assemblage to the 
centroid 

No 

Yes 

Yes 

measure 

(8) Canberra index 

Lance and Williams 

(1967) 


No 

No 

Yes 

(9) Correlation 

coefficient 

See, for example, Krebs 
(1999) 


Yes 

Yes 

No 

(10) Normalized 

expected species 

Grassle and Smith 

(1976) 

Chao et al. (2008) 

Yes 

No 

Yes 

shared (NESS) 

(11) Chao-Jaccard and 

Chao et al. (2005) 

Not available 

Yes 

Yes 

Yes 

Chao-Sorensen 


Formulas 

Assume that there are 5, species in the /th assemblage and S species in the combined assemblage. Let 5 denote the average number of species, M„ denote the 

abundance of the /'th species in the nth assemblage, and p, r denote the relative abundance, / = 1,2.S and r = 1, 2,.... N. Thus, we have N sets of species 

abundance {(Mu. M 2r __ Msr)', r = 1. N) and N sets of the relative abundance {(pi r , p 2r ,.... psr): r = 1, _ A/). 

Incidence-based indices 

(1) Incidence-based Jaccard for two assemblages = SnHS\ + S 2 - Su) 

Incidence-based Jaccard for multiple assemblages = (S/S - 1/W)/(1 - 1/W) 

(2) Incidence-based Sorensen for two assemblages = 25u/(5i + S 2 ) 

Incidence-based Sorensen for multiple assemblages = (A/ - S/5)/(W - 1) 

Abundance-based indices 

(11 Horn overlap index for two assemblages = equation 6.1 
Horn overlap index for N assemblages = equation 6.4 

(2) Morisita—Horn index for two assemblages = equation 6.2 
Morisita-Horn index for N assemblages = equation 6.5 

(3) Additive (based on Gini—Simpson index) for A/assemblages = H a /H y , where 

ns s N 

W a = (1//v)£(1 Hy = 1 -£>/ andp, =^p, 7 //V 

M <=i i-1 7-1 
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Table 6.2 Continued. 


(4) Additive (based on entropy) for Wassemblages = HJH y , where 

^ s n s 

H a = - — Pd lo 9<M i )• H r = - 2Z P l°g(p ( ) and 'p, is defined in (3) 

;= i y-i <=> 

(5) Bray—Curtis index for two assemblages = equation 6.6 

(6) Percentage similarity for two assemblages = equation 6.7 

(7) Gower/Euclidean/Minkowski similarity for two assemblages = equation 6.8 

(8) Canberra index for two assemblages = 1 - A £ -n ~~ r g 2 ' 

r=i 11 ,2 

(9) Correlation coefficient = correlation of the two sets of abundances (Mu, M 21 .Ms 1 ) and (M 12 , M 22 __ Ms 2 ). This measure takes value between 

-1 and!. 

(10) Normalized expected species shared (NESS(m)) for two assemblages: 

s 

2IwWbW 

_i=i_ 

£[|u,,(m)P + £( W2 <m)P 

where Wj (m) = 1 - (1 - p, 7 ) m . 

NESS(m) for /V assemblages: 

s 

£ £ W j(m>W(m) 

2 1=1 ;<* _ 

< w - ’) £?., £" , Uv(m)P 

(11) Chao-Jaccard for two assemblages = UV/(U + V - UV) 

Chao-Sorensen for two assemblages = 2UV/(U + V) 

where U(V) denotes the total relative abundance of the shared species in assemblage 1 (2). 


to the total number of species in the combined 
assemblages, while the Sorensen index compares 
the number of shared species to the mean num¬ 
ber of species in a single assemblage. The Jaccard 
index takes a global view while the Sorensen index 
takes a local view. Table 6.3 gives some exam¬ 
ples. The Sorensen index is a true overlap mea¬ 
sure (Wolda 1981): if the two assemblages have the 
same number of species, the Sorensen index gives 
the proportion of shared species in each assem¬ 
blage (i.e. the proportion of an assemblage's species 
list that overlaps with the species list of the other 
assemblage). 

The Sorensen index is the harmonic mean of the 
proportion of shared species in the first assemblage 
and the proportion of shared species in the sec¬ 
ond assemblage. The Ochiai index is the geometric 
mean and the Kulczynski-Cody index (Kulczynski 
1928) is the arithmetic mean of the two proportions. 
Because of the relationship between these means, it 


is always true that Sorensen index < Ochiai index < 
Kulczynski-Cody index. 

When the richness of one assemblage is much 
greater than the richness of the other, both the Jac¬ 
card and Sorensen indices are always small. This 
is an accurate reflection of the difference in species 
compositions of the two assemblages, but for some 
applications it can be useful to normalize the mea¬ 
sures so that they take the value unity when overlap 
is as large as it can be, given the respective rich¬ 
nesses of the two assemblages. Lennon et al. (2001) 
proposed this modification to the Sorensen index, 
which may be called the conditional Sorensen 
index. The formula for the conditional Sorensen 
index is given in Table 6.1. When there are no 
unique species in one of the assemblages, the condi¬ 
tional Sorensen index always yields a value of 1 no 
matter how many unique species there are in the 
other assemblage. Hence, this measure is not infor¬ 
mative when applied to two nested assemblages 
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Table 6.3 Artificial examples with four sets (A, B, C, D) of assemblages illustrating various similarity indices. 


Species 

Set A 

Assemblage 

1 2 

Set B 

Assemblage 

3 4 

SetC 

Assemblage 

5 6 

SetD 

Assemblage 

7 8 

9 

a 

0.125 

0.25 

0.6 

0.55 

0.8 

0.10 

0.75 0.20 

0.10 

b 

0.125 

0.25 

0.1 

0.05 

0,1 

0.55 

0.01 0.15 

0.20 

c 

0.125 

0.25 

0.1 

0.03 

0.1 

0 

0.02 0.05 

0.50 

d 

0.125 

0.25 

0.1 

0.02 

0 

0.33 

0.03 0 

0 

e 

0.125 

0 

0.05 

0 

0 

0.01 

0.19 0 

0 

f 

0.125 

0 

0.05 

0 

0 

0.01 

0 0.4 

0 

9 

0.125 

0 

0 

0.20 

0 

0 

0 0.2 

0 

h 

0.125 

0 

0 

0.15 

0 

0 

0 0 

0.08 

i 

0 

0 

0 

0 

0 

0 

0 0 

0.12 

Incidence based 









Jaccard 

0.50 


0.50 


0.33 


0.33 (See text) 


Sorensen (Con) 

0.67 


0.67 


0.50 


0.60 (See text) 


Lennon et al. 

1.00 


0.67 


0.67 


0.50 (Baselga et al. 2007) 

Abundance based 









Bray—Curtis 

0.33 


0.53 


0.20 


0.23 (pairwise mean) 

Percentage 

0.50 


0.65 


0.20 


0.22 (pairwise mean) 

Horn entropy (C 1N ) 

0.69 


0.73 


0.43 


0.41 (equation 6.4) 

Morisita-Horn (C 2 n) 

0.67 


0.89 


0.25 


0.27 (equation 6.5) 

Chao-Jaccard 

0.50 


0.61 


0.61 


Not available 


Chao-Sorensen 

0.67 


0.75 


0.75 


Not available 



For each species in the assemblage, the relative abundance is indicated. (For computing the Bray—Curtis index, the total 
number of individuals is assumed to be 100 in Assemblages 1, 3, 5, 7, and 9, and 200 for Assemblages 2,4, 6, and 8.) 


(see Set A of Table 6.3). See Legendre and Legendre 
(1998) and Koleff et al. (2003) for the interpretation 
of the other measures listed in Table 6.1. 

Most of these measures were originally designed 
to compare two assemblages. When more than 
two assemblages are compared, ecologists have 
often averaged their pairwise similarities (e.g. 
Lennon et al. 2001; Vellend 2001). However, the 
pairwise similarities tend to be correlated, which 
will cause an inference problem (Diserud & 0de- 
gaard 2007). Aside from the correlation prob¬ 
lem, pairwise similarities cannot fully characterize 
multiple-assemblage similarity when some species 
are shared across two, three, or more assem¬ 
blages. A simple example is given by Chao et al. 
(2008). These problems have motivated the search 
for multiple-assemblage generalizations of similar¬ 
ity measures that can take global similarity into 
account. 


Although there were measures in the literature 
that can be regarded as multiple-site Sorensen 
or related measures, a 'direct' extension of the 
Sorensen index to simultaneously compare multi¬ 
ple assemblages was first derived by Diserud & 
0degaard (2007). It has the form (N — S/S)/(N — 1), 
where N denotes the number of assemblages, S 
denotes the total number of species in the com¬ 
bined assemblage, and S denotes the average num¬ 
ber of species per assemblage. This measure is a 
monotonic transformation of Whittaker's (3 diver¬ 
sity (see below for details). It is identical to the com¬ 
plement of the 'turnover' measure by Harrison et al. 
(1992). For multiple assemblages, this generalized 
Sorensen index provides the overall proportion of 
shared species from a local perspective. Consider 
the special case in which each of the N assem¬ 
blages has S species. Suppose that exactly R species 
are shared by all assemblages, and the remaining 
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species are unique to their assemblage. This mea¬ 
sure then gives the proportion of each assemblage's 
species that overlap with the other assemblages, 
R/S The conditional Sorensen index has also been 
generalized to multiple assemblages (Baselga et al. 
2007). 

A multiple-site Jaccard index was presented 
by Koch (1957). It is expressed as (S/S — 1/N)/ 
(1 — 1/N), which measures the overall proportion 
of the shared species in the combined assem¬ 
blage. In the example of the preceding paragraph, 
this measure reduces to R /(total number species 
in the combined assemblage). Like the multiple- 
assemblage generalization of the Sorensen index, 
it is a monotonic transformation of Whittaker's 
multiplicative |3 diversity (Jost 2007). This mea¬ 
sure is less useful than the multiple-assemblage 
Sorensen index because it tends to decline as more 
assemblages are compared, even if overlap between 
assemblages is kept constant. 

In Table 6.3, Set D includes three assemblages 
with completely different non-shared species. There 
are nine species in the whole assemblage, and in 
each assemblage there are five species, three of 
which are shared by all assemblages. The pro¬ 
portion of shared species relative to the com¬ 
bined assemblages (multiple-assemblage Jaccard 
index) is 1:3, whereas the average proportion 
of shared species in each assemblage (multiple- 
assemblage Sorensen index) is 3:5. The values 
for the conditional Sorensen index are also given 
in Table 6.3. 

Each of these measures provides different 
information about compositional similarity. When 
choosing a measure, ecologists should think care¬ 
fully about whether their perspective is local 
or global. If it is local, the multiple-assemblage 
Sorensen index is an informative and widely used 
index, facilitating comparisons across studies. The 
conditional version of the Sorensen index gives 
useful additional information, so if diversities dif¬ 
fer greatly among assemblages this should also 
be presented. If the perspective is global, the 
multiple-assemblage Jaccard index and its condi¬ 
tional generalization could be used, but if the 
number of assemblages is high, both should be 
interpreted with caution (see above). When there 
are many assemblages, it is often dcfsier to convey an 


intuitive picture of similarity through the Sorensen 
index rather than the Jaccard index, regardless of 
whether the perspective is local or global. 

A more basic decision is whether or not to use 
incidence-based similarity measures at all. They are 
attractive because of the apparent ease of fieldwork, 
since only presence or absence needs to be noted. 
When nearly all species in each assemblage can be 
sampled, they are easily interpretable tools for com¬ 
paring species lists. In addition, incidence-based 
measures may be the only alternative available 
when it is hard to define or count individuals of 
each species, as in 'uncountable' microbes, colonies 
of insects, or plant and invertebrate taxa with clonal 
growth. 

In species-rich assemblages, however, intensive 
sampling efforts are needed to make complete or 
nearly complete lists of the species in a given 
assemblage. In practice, similarity measures must 
be estimated from sample data, so the species rich¬ 
ness and shared species richness in the formulas 
of Table 6.1 are replaced by the observed counts. 
It is well known (e.g. Wolda, 1981, 1983; Magur- 
ran, 2004, p. 175) that all incidence-based indices 
are biased when sampling is incomplete, and the 
biases are likely to be substantial for assemblages 
with high species richness and a large fraction of 
rare species. The Jaccard and Sorensen indices are 
often biased downwards (Chao et al. 2006), but 
could be biased upwards too. The biases exist even 
in the simplest case when all species are equally 
abundant. The bias cannot be reduced or removed 
by using equal sampling fractions, equal sample 
sizes, or equal effort. Not only the bias but also 
the variance depends on species abundances (not 
incidence alone); thus, it is impossible to correct for 
the bias or to estimate errors without using abun¬ 
dance data. Even when abundance data are avail¬ 
able, correcting the bias and assessing variance is 
not easy. See Chao et al. (2005, 2006) for relevant 
discussions. 

One may be inclined to estimate species richness 
and shared species richness using non-parametric, 
low-bias estimators (Chao 2005), plugging these 
into the formulas in Table 6.1. However, replac¬ 
ing observed values of species richness and shared 
species with non-parametric estimates is problem¬ 
atic for assessing assemblage similarity. Combining 



COMPOSITIONAL SIMILARITY AND |3 DIVERSITY 73 


these estimates in the formulas of Table 6.1 
unavoidably inflates the variance and often renders 
the resulting estimate useless. 

Finally, and perhaps most importantly, incidence- 
based similarity measures greatly oversimplify the 
relationships between assemblages. All such mea¬ 
sures regard a maple forest with a few scattered 
pine trees as identical in composition to a pine for¬ 
est with one or two maples, even though the two 
forests are ecologically very different. The relative 
abundances of species are ecologically important 
quantities. They should not be ignored, unless there 
is no choice. 

Abundance-based similarity and distance measures 
An ecologically more meaningful and informative 
approach to assessing similarity focuses on the dif¬ 
ferences in species frequencies among assemblages. 
This approach is also statistically more accurate 
and precise than incidence-based approaches. Some 
authors have adopted a parametric approach to 
abundance-based similarity measures by assum¬ 
ing that the relative abundances follows a para¬ 
metric distribution (e.g. Smith, Solow & Preston 
1996; Plotkin & Muller-Landau 2002). However, it 
is almost impossible to test whether a given sam¬ 
ple really came from a particular parametric dis¬ 
tribution (Chao 2005). We therefore focus on non- 
parametric indices that do not assume a particular 
kind of species abundance distribution. 

In all the formulas that follow, we assume 
there are N assemblages, with S total species 
in the combined assemblages Let M, r denote the 
absolute abundance of the z th species in the 
rth assemblage, and p, r denote its relative abun¬ 
dance, with i = 1,2,..., S and r = 1,2,..., N. Thus, 
we have N sets of species absolute abundances 
{(Mi,, M 2 ,,..., Msr); r = l, .... N } and N sets of 

the relative abundances {(p\ r , Pir . Psr); r = 1, 

. N] 




( 6 . 1 ) 


This index equals unity if and only if p n = p i2 for all 
i . When the two assemblages have equal numbers 
of species and consist entirely of equally common 
species, the Horn index is equal to the Sorensen 
index and gives the proportion of shared species in 
an assemblage. 

Another popular overlap measure is the Morisita 
Horn index, derived by Morisita (1959) and Horn 
(1966): 


Smh - 


2 EL P-1 P-2 
[EL Pi i + E,^ Pf 2 ] 


EL (Pn ~ Pn) 2 

EL ti + EL Pil 


( 6 . 2 ) 


When the two assemblages are equally diverse and 
consist entirely of equally common species, the 
Morisita-Horn index is equal to the Horn index and 
the Sorensen index, and all give the proportion of 
shared species in an assemblage. The right side of 
equation 6.2 shows that the Morisita-Horn measure 
is based on the squared differences of the relative 
abundances of each species in the two assemblages. 
Because of this squared distance, the Morisita-Horn 
index is dominated by the most abundant species, 
while the relatively rare species have little effect 
(even if there are many of them). This makes the 
measure resistant to under-sampling because the 
influential abundant species are always sampled 
relatively accurately. Since ecological processes are 
often most strongly influenced by the dominant 
species, this measure is a good measure when 
looking for functional differences between ecosys¬ 
tems. It may not be appropriate when rare species 
are important, as in conservation applications. The 
Horn overlap measure would be more useful in 
those applications. 


Horn and Morisita-Horn overlap measures 
The incidence-based Sorensen overlap index has 
abundance-based relatives. One of the most useful 
is the Horn (1966) overlap measure, based on Shan¬ 
non's entropy (Chapter 4). In its original form it 
compares two assemblages, species by species. To 
make this clearer, it can be expressed as 


CqN, a general multiple-assemblage abundance-based 
overlap measure 

Surprisingly, all the overlap measures we have 
discussed—the Sorensen index, the Horn index, 
and the Morisita-Horn index—are special cases 
of a single general multiple-assemblage overlap 
measure C,n (Chao et al. 2008): 
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XfWZN) t.[(Pn + Pi! + ■■■ + PiNf ~ (p?, + P? 2 + • • • Pin)] 


Cijhi = 


wE(P?i + P?2 + -P?n) 

i=l 


(6.3) 


Here g is a parameter that determines the measure's 
sensitivity to species' relative abundances, and N 
is the number of assemblages. When N = 2, setting 
q = 0 yields the Sorensen index, taking the limit as q 
approaches unity yields the Horn index, and setting 
q = 2 yields the Morisita-Horn index. Setting N > 2 
yields the corresponding multiple-assemblage gen¬ 
eralizations. The generalized Sorensen index is 
Con = (N — S/S)/(N — 1) and the generalized Horn 
index can be expressed as 


C,w - 


lc « N ir ir 


77 lo g 1 + E P*/P'7 

V k # ) 


(6.4) 


The multiple-assemblage Morisita-Horn index can 
be simplified as 


C 2 n - 


2 E E PijP.k 

i=l j<k 

(N- DEmEm Pii 


(6.5) 


For integer values of q between 2 and N, over¬ 
lap measures C,,n have a simple statistical interpre¬ 
tation as the ratio of two probabilities, ^Gd/’Gs- 
The numerator is the probability that q randomly 
sampled individuals belong to the same species, 
given that they did not all come from the same 
assemblage. The denominator is the probability 
that q randomly sampled individuals belong to 
the same species given that they are all drawn 
from the same assemblage (Chao et al. 2008). This 
interpretation shows that the value of q determines 
the 'depth' of the measure: when q = 2, only pair¬ 
wise similarity is considered, but when q = 3 the 
measure also takes into account species that are 
shared by three assemblages. This measure is den¬ 
sity and replication invariant, and monotonic with 
respect to unambiguously decreasing similarity. 
Like the multiple-assemblage generalizations of the 
Sorensen index, C, N gives the true overlap R/S 
for all orders of q when each assemblage consists 
of S equally common species, with R species con¬ 
tained in all assemblages and the remainder unique 


to single assemblages. In the section on using trans¬ 
formations of |3 to measure compositional similarity 
(below) we give an example of the use and interpre¬ 
tation of these measures. 

The expected value of C 2 n (the Morisita-Horn 
index and its multiple-assemblage generalization) 
can be predicted under Hubbell's (2001) neutral 
model of biodiversity, which makes it an impor¬ 
tant theoretical tool. In Hubbell's neutral model, the 
community structure depends only on the number 
of communities N, the number of individuals in 
each community n (all communities are assumed 
to have the same size), the migration rate between 
communities m, and the speciation rate v. After 
many generations it reaches an equilibrium, and the 
structure at equilibrium can be predicted from the 
model parameters. This model is mathematically 
identical to Wright's finite-island model in popu¬ 
lation genetics. The expected equilibrium value of 
the complement of the Morisita-Horn index has 
been derived for that model (Jost 2008); under the 
approximation that 1 » m » v, the formula for the 
expected equilibrium value of the Morisita-Horn 
index itself is Smh = m/[v(N — 1) + m). If similarity 
between assemblages is much different from this, 
some causal factors are implicated. C\n (the Horn 
measure of similarity and its multiple-assemblage 
generalization) can also be connected to this model 
under certain special conditions (Sherwin et al. 
2006), although a simple analytical expression in 
terms of model parameters still eludes us. 

Bray-Curtis and Renkonen similarity measures 
One of the most frequently used abundance-based 
similarity measures is the Bray-Curtis or 'quantita¬ 
tive Sorensen' index (Bray & Curtis 1957), devel¬ 
oped by Bray and Curtis during their pioneering 
work on plant community ordination. Beals (1984) 
gives a detailed review of the Bray-Curtis ordina¬ 
tion and related techniques. The Bray-Curtis simi¬ 
larity is expressed as 

„ 2E^min(M n ,M i2 ) „ Em |M n - M,- 2 | 

ORC = - c - — 1 — -F- 

Em (M/i + M i2 ) E/.1 (Mfi + M i2 ) 

( 6 . 6 ) 

where min(M,i, M 2 ) denotes the smaller of the two 
numbers. Mi or M 2 - This index reduces to the 
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Sorensen index if all species in each assemblage 
are equally abundant. The expression on the right 
in equation 6.6 shows that it takes the maximum 
value of 1 if and only if the two sets of absolute 
abundances are identical, that is, Mj, = Mu for all 
species. This shows that the index confounds den¬ 
sity with compositional similarity, so it cannot be 
considered a measure of compositional similarity. 
It approaches 0 when sample sizes are very differ¬ 
ent, whether the assemblages have the same com¬ 
positions or completely different compositions. For 
example, suppose every species in a young sec¬ 
ondary forest has exactly the same relative abun¬ 
dance as in a primary forest. Suppose the secondary 
forest has four times as many stems per hectare 
as the primary forest. Then the Bray-Curtis index 
will be approximately 1-| = 0.4 rather than unity. 
If the secondary forest had 10 times the density of 
the primary forest, the Bray-Curtis index would be 
about 0.18 instead of unity. If it had 100 times the 
density of the primary forest, the Bray-Curtis index 
would be close to 0.02. 

In addition to these conceptual problems, the 
Bray-Curtis index also has some statistical prob¬ 
lems when applied to samples. The observed 
absolute abundance for any species depends on 
the sampling fraction (the ratio of sample size to 
the total number of individuals in the assemblage), 
so this index becomes meaningless and performs 
erratically when the sampling fraction is unequal 
in the two assemblages (Chao et al. 2006). It also 
generally has a very large bias in this case (Chao 
et al. 2006). For these reasons, from both statistical 
and conceptual perspectives, the Bray-Curtis index 
cannot be recommended unless sampling fractions 
are known to be equal. Given the unlikely prospect 
of establishing such conditions for field data, the 
Bray-Curtis index seems rarely to be an acceptable 
choice for such data. Note that equalizing the num¬ 
ber of individuals (sample sizes) in all samples by rar¬ 
efaction before calculating the Bray-Curtis index, as 
suggested by Horner-Devine et al. (2004), does not 
equalize sampling fractions unless the assemblages 
themselves can be reasonably assumed to have the 
same total number of individuals susceptible to 
sampling. 

It follows from equation 6.6 that the Bray- 
Curtis measure can be interpreted as a normalized 


Manhattan distance (absolute difference). When 
there are multiple assemblages, as in the conven¬ 
tional distance-based approach, an average of pair¬ 
wise Bray-Curtis values is generally used as an 
overall similarity measure. This approach, unfortu¬ 
nately, ignores information about the species shared 
among three or more assemblages (Chao et al. 
2008). 

Renkonen (1938) proposed the following mea¬ 
sure of similarity: 

s 1 ^—> s 

Sp = 2^. =1 min (Pn, Pa) = 1 - 2 Z ^/4 , 

(6.7) 

This measure is based on the Manhattan distance of 
the relative abundances of the species in the assem¬ 
blages, instead of the absolute abundances as in the 
Bray-Curtis index. It is therefore density invariant, 
*" so it is a valid measure of compositional similarity. 
It attains a minimum value of 0 when two assem¬ 
blages are completely distinct (no shared species) 
and it attains a maximum value of 1 if and only 
if the two sets of relative abundances are identi¬ 
cal. Following the work of Renkonen (1938), Whit¬ 
taker (1952,1972) and Wolda (1981), this index has 
become one of the most commonly used measures 
among ecologists. Gregorius (1987) also suggested 
its use for a proper genetic distance among popula¬ 
tions. 

Smith and Zaret (1982) found that in their simula¬ 
tion trials this measure tends to be biased when esti¬ 
mated from sample data, but Wolda (1981) recom¬ 
mended its use if a logarithm transform is applied 
to species frequencies to reduce the dominance of 
the most abundant species. This measure suffers 
from the same disadvantage as the Bray-Curtis 
index, namely that the analysis for multiple sites 
can only be based on pairwise comparisons, and 
thus information about species shared between 
three or more sites is ignored. 

Distance measures 

We have seen that the two-assemblage Bray-Curtis 
and Renkonen similarity indices are complements 
of some types of normalized Manhattan distance, 
and the two-assemblage Morisita-Horn index is 
the complement of a type of normalized Euclid¬ 
ean distance. This use of normalized distance 
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between assemblages is a natural way to form 
an index of similarity or differentiation between 
two sets of quantitative variables. There are many 
distances, and some common approaches include 
Euclidean distance, Manhattan distance, Lance and 
Williams' (1967) Canberra distance, chi-square dis¬ 
tance, Minkowski distance, Orloci's chord distance, 
and Hellinger distance (Legendre & Legendre 1998; 
Gower 1985). As indicated by Gregorius (1996), a 
distinction between 'distance' measures and 'dif¬ 
ferentiation' measures is that 'distance' refers to a 
pair of assemblages and obeys the triangle inequal¬ 
ity whereas 'differentiation' refers to any number 
of assemblages but need not satisfy the triangle 
inequality. A unified class of distance-based simi¬ 
larity measures is the one-complement of the nor¬ 
malized Minkowski distance: 


where p > 1, and R, denotes the range (i.e. the dif¬ 
ference of the maximum and minimum) of the ith 
species abundances. The special case p = 1 corre¬ 
sponds to the Gower (1971) similarity index. The 
special case p = 2 corresponds to the similarity 
index based on the Euclidean distance. 

Due to the restriction that 'distance' is only cal¬ 
culated for a pair of assemblages, this approach 
cannot be directly extended to the case of multiple 
assemblages. Thus, the information shared by at 
least two assemblages is ignored in the traditional 
approach using the average of pairwise distances 
as the overall dissimilarity. Anderson et al. (2006) 
modified this approach by considering a multivari¬ 
ate dispersion as the '|3' diversity or a dissimilar¬ 
ity measure. A multivariate dispersion is defined 
as the average of the distance from an individual 
assemblage to the 'centroid.' This approach does 
simultaneously compare the whole set of assem¬ 
blages because the information shared by multiple 
assemblages can be incorporated in the 'centroid'. 


Indices based on total abundance of shared species 
The abundance-based indices discussed above 
match species relative abundances, species by species. 
That is, the typical similarity indices assess a nor¬ 
malized probability that two randomly chosen indi¬ 
viduals, one from each assemblage, belong to the 
same species. Chao et al. (2005, 2006) derived a 


class of measures which look at a different kind 
of similarity, based on assessing the normalized 
probability that two randomly chosen individu¬ 
als, one from each assemblage, belong to shared 
species (not necessarily the same species). It is sim¬ 
ple to convert any incidence-based measure that is 
replication invariant to its shared-abundance ver¬ 
sion. Let U denote the total relative abundances 
associated with the shared species in Assemblage 
1 and let V denote the total relative abundances 
of the shared species in Assemblage 2. By replac¬ 
ing a, b and c in Table 6.1 with UV, tf(l - V) 
and V(l — U), we obtain for each incidence-based 
index its shared-abundance version, given in the 
last column of Table 6.1. These indices attain 
the maximum value of 1 if and only if U = V = 1 
(even if the two sets of abundances may be dif¬ 
ferent). These two shared-abundance indices are 
called the Chao-Jaccard and Chao-Sorensen abun¬ 
dance indices in the literature and software pack¬ 
ages (e.g. Estimates by Colwell (2006) and SONS by 
Schloss and Handelsman (2006a)). 

Unlike the ordinary similarity indices that match 
species-by-species abundances, these measures 
match the total relative abundances of species 
shared between two assemblages. In Sets B and C 
considered in Table 6.3, the total shared species 
abundances are the same for the first assemblage 
in each set (U = 0.9) and also the same for the 
second assemblage in each set (1/ = 0.65), yielding 
the same Chao-Sorensen and Chao-Jaccard shared 
abundance indices, even though the patterns of 
abundance for species a and b are drastically 
different for the two sets. Set C in particular shows 
that these two indices are quite different from 
species-by-species abundance-based similarity 
measures. 

One main advantage of these measures is that the 
under-sampling bias due to unseen shared species 
can be evaluated and corrected. This class of mea¬ 
sures can also be extended to deal with repli¬ 
cated incidence data. The extension of this approach 
to assessing similarity of multiple assemblages, 
although conceptually simple, is statistically com¬ 
plicated and is still under development. These mea¬ 
sures are designed to be sensitive to rare shared 
species while still taking abundance into account, so 
they may make large jumps as more shared species 
are discovered. They should be used only if their 
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concept of similarity is relevant to the question of 
interest. For examples of their use and interpreta¬ 
tion, see Chao et al. (2005, 2006), Anderson et al. 
(2006) and Schloss and Handelsman (2006a). 

6.2.2 Diversity and compositional similarity 

There is an intimate connection between species 
diversity and compositional differentiation and 
similarity. If assemblages are identical in com¬ 
position, pooling them will leave their diversity 
unchanged. If assemblages are very different in 
composition, pooling them together with equal 
weights should result in a total diversity consid¬ 
erably greater than the diversity of any of the 
assemblages individually. Comparing the diversi¬ 
ties of the individual assemblages to the diversity 
of the pooled assemblages will provide informa¬ 
tion about their compositional similarity, as long 
as the pooling gives equal weight to each assem¬ 
blage. Many of the similarity and differentiation 
measures discussed above are based on different 
ways of comparing the diversities of the individ¬ 
ual assemblages to the diversity of the pooled 
assemblages. 

Similarity ratio based on diversity measures 
Ecologists often use the ratio of mean within-group 
diversity (the'« diversity') to total pooled diversity 
('y diversity') as a measure of the compositional 
similarity of the groups (Lande 1996). If this ratio 
is close to unity, ecologists infer that the groups 
must be similar in composition. If the ratio is low, 
they infer that the groups are highly differentiated 
in composition. 

While this reasoning seems logical, its validity 
actually depends on the measure used. When the 
most common measures of complexity are used, 
such as Shannon entropy (see Chapters 3 and 5) 

s 

Hs = -£ Pi lo g Pi 

i =1 

or the Gini-Simpson index 

Hgs = i - £ v 2 

;=i 

the ratio of mean within-group 'diversity' to total 
pooled 'diversity' always approaches unity when 


within-group diversity is high (MacArthur 1965; 
Whittaker 1972; Jost 2006, 2007; Tuomisto 2010) 
even if all the groups are completely distinct (no 
species in common). This problem arises frequently 
in practice and can lead to serious misinterpre¬ 
tations (Jost et al. 2010). These 'similarity' mea¬ 
sures based on the Gini-Simpson index or Shan¬ 
non entropy are also not monotonic with respect 
to unambiguously increasing similarity, so they do 
not rank datasets correctly in terms of their simi¬ 
larity (Jost et al. 2010). They should not be used to 
infer compositional similarity across space or time. 
Because of their non-linearity with respect to pool¬ 
ing of equally diverse groups, they are also danger¬ 
ously misleading in conservation applications (Jost 
2009). - 

The great mathematical ecologist Robert 
MacArthur was the first to notice the problems 
with computing similarity ratios based directly 
on Shannon entropy or the Gini-Simpson index 
(MacArthur 1965). He solved the problem by 
converting these measures to effective number 
of species. The effective number of species is 
the number of equally common species needed 
to produce the observed value of an index. For 
example, to find the effective number of species 
for a Shannon entropy of 2.4 (assuming this value 
of 2.4 was calculated using natural logarithms), 
we need to find out how many equally common 
species are needed to obtain a Shannon entropy of 
2.4. The answer is 11 species. Shannon entropy is 
converted to effective number of species by taking 
its exponential: 

1 D = exp(Hs) = exp £ p, log p)j 

= 'exponential of Shannon entropy' 

while the Gini-Simpson index or heterozygosity is 
converted by the following formula: 

s 

2 D = l/(l-Hcs) = l/£p , 2 

i=l 

= 'inverse Simpson concentration' 

MacArthur (1965) observed that when the numer¬ 
ator and denominator of the 'similarity' ratio 
described in the preceding paragraphs is converted 
to effective number of species, the ratio truly did 
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reflect the compositional similarity of the assem¬ 
blages, as long as both the within-group mean 
and the pooled diversity are calculated using equal 
weights for each assemblage. Pooling with equal 
weights means not pooling samples directly but 
pooling them in equal proportions. 

The particular mathematical property which 
makes some measures behave properly in similarity 
ratios while others fail is known in ecology as the 
doubling property or in economics as the replica¬ 
tion principle (Dalton 1920; Hannah and Kay 1977). 
Suppose we make N equally large copies of an 
ecosystem, each with identical species abundance 
distributions but with completely different species. 
The replication principle states that the diversity of 
the N pooled ecosystems (which share no species) 
must be N times the diversity of the original ecosys¬ 
tem. This replication principle seems to be implicit 
in many forms of reasoning about diversity (Hill 
1973; Jost 2009). (Replication invariance, discussed 
above, refers to measures that do not change when 
assemblages are replicated N times.) 

Hill (1973) derived a parametric family of diver¬ 
sity measures that obey the replication principle: 



This formula gives the effective number of species 
of all standard complexity and diversity measures 
(Jost 2007). As discussed earlier, the value of q deter¬ 
mines the sensitivity of the measure to species rel¬ 
ative abundances; ecologists can either emphasize 
(q > 1) or de-emphasize (q < 1) the relative abun¬ 
dances. When q = 0, this formula gives species rich¬ 
ness. When q =2, it gives the inverse Simpson con¬ 
centration (Simpson 1949). When q = 1 (the value 
that weighs all species exactly by their frequency), 
the formula is undefined (because of a division by 
zero in the exponent), but the limit as q approaches 
unity exists and equals the exponential of Shan¬ 
non entropy. Shannon entropy thus can be derived 
directly from the mathematics of diversity and does 
not need to be borrowed from other fields. In order 
to avoid invalid inferences about diversity, these 
measures should always be used in preference to 
the raw Shannon entropy or Gini-Simpson index 
(MacArthur 1965; Whittaker 1972; Routledge 1979; 


Peet 1974; Jost 2006, 2007, 2009). Measures of com¬ 
positional complexity that obey the replication prin¬ 
ciple are the 'true diversities' of Jost (2007,2009) and 
Tuomisto (2010). 

When a complexity measure that obeys the repli¬ 
cation principle is used in the similarity ratio, with 
all assemblages weighted equally and with the 
mean defined in equation 6.9a below, the similar¬ 
ity ratio ranges from unity (complete similarity) to 
1/N (complete dissimilarity). This range does not 
depend on their diversities and is easily normalized 
onto the interval from 0 to 1. 

Partitioning diversity into a and |3 components 
As mentioned in the preceding section, a, diversity 
is a mean (not necessarily the arithmetic mean) of 
the diversities of a set of assemblages, whereas y 
diversity is the diversity of the pooled assemblages. 
y diversity is at least as great as a diversity, and 
will exceed a diversity when the assemblages are 
differentiated, y diversity therefore has two contri¬ 
butions, one from a diversity and the other from 
differentiation between assemblages. The contribu¬ 
tion of differentiation to the total or y diversity is 
called |3 diversity. 

What is the relationship between a, ]3, and 
y? Whittaker (1972) proposed that |3 be defined 
through a multiplicative relationship: 

a. x |3 = y 

He applied this relationship to effective numbers of 
species (species richness, the exponential of Shan¬ 
non entropy, and the inverse Simpson concentra¬ 
tion). He restricted his discussion to the case of 
equally weighted assemblages, which is the appro¬ 
priate choice when the goal is to measure composi¬ 
tional similarity. 

Lande (1996) proposed instead that [3 be defined 
through an additive relationship with a: 

a + |3 = y 

This additive approach is meant to apply to con¬ 
cave measures of diversity and complexity (see 
Lande (1996) for a definition of concave), such as 
species richness (see Chapter 4), Shannon entropy 
(see Chapters 3 and 5), and the Gini-Simpson index. 
In this approach, a is the weighted arithmetic 
mean of the diversity or complexity measure of the 
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individual assemblages. (3 is found by subtracting 
a from y. The concavity property ensures that 7 is 
never less than oc, so 3 defined in this way is always 
non-negative. This approach is superficially similar 
to the additive partitioning of variances. 

These approaches are often respectively called 
multiplicative partitioning and additive partition¬ 
ing of 7 into a and 3 components. However, 'parti¬ 
tioning' usually means a complete separation of the 
two components, oc (the within-group component) 
and (3 (the between-group component). A genuine 
partitioning of 7 into cc and (3 components would 
place all the within-group influence into the oc com¬ 
ponent, and all the between-group influence into 
the |3 component. If the partitioning is complete, 
the oc component provides no information about the 
value of the (3 component, and vice versa; they are 
completely independent components of 7 diversity 
or complexity measure. 

When a complete partitioning of 7 exists, its 
form is unique for any given measure. It exists 
for all standard diversity and complexity measures 
when assemblages are given equal weights. For 
species richness (q = 0 ), the complete partitioning 
is multiplicative: ax |3 = 7 . For Shannon entropy 
(q = 1), it is additive: oc + (3 = 7 . For the Gini- 
Simpson index, it is neither additive nor multiplica¬ 
tive: a + |3 — ax| 3 = 7 . For all effective number of 
species, or Hill numbers of any order, which are true 
diversities in the sense of Jost (2007), it is multiplica¬ 
tive: oc x (3 = 7 (Jost 2007). The components are: 
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(6.9b) 

(6.9c) 


|3 for true diversities is just the reciprocal of the 
similarity ratio of the previous section. It ranges 
from 1 to N, and gives the effective number of com¬ 
pletely distinct assemblages; it is a divergence mea¬ 
sure between the species probability distributions 
of the assemblages. It is a 'true 3 diversity' in the 
senses of Jost (2007) and Tuomisto (2010). In the 
special case of q =0, equation 6.9c reduces to Whit¬ 


taker's p diversity S/S based on species richness. 
Investigators using these measures should at least 
report the values of (3 for q = 0,1, and 2, in order to 
give a picture of how differentiation varies between 
all species (q = 0 ), the typical species (q = 1 ), and the 
dominant species (q =2). A more complete picture 
is conveyed by a graph of 3 diversity from q = 0 to 
q = 4 or 5 (beyond this it usually does not change 
much). This is the 3 diversity profile. This can be 
easier to interpret when normalized onto the unit 
interval to give a similarity profile, as described in 
the next section (see Box 6.1 for an example). 

When assemblage area or size needs to be taken 
into account, as when measuring regional hetero¬ 
geneity (Horn 1966) instead of compositional dif¬ 
ferentiation, a complete partitioning into a mea¬ 
sure of mean within-site diversity and a measure 
of between-site differentiation only exists for mea¬ 
sures based on Shannon entropy and species rich¬ 
ness (Jost 2007). For the exponential of Shannon 
entropy, the components are: 


s s 

1 D„ = exp ( -101 Y2 Vi 1 lo g Vi 1 Pi 2 log Pi 2 ■ ■ ■ 


““^E Pin log Pn 
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1 D y = exp 
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( 6 . 10 b) 


1 Da = 1 Dy/ 1 D a (6.10c) 

Here the weights uq are usually taken to be the 
relative sizes (either in terms of area or total pop¬ 
ulation) of the assemblages. This 3 ranges from 1 
to the weight diversity, which is the exponential 
of the Shannon entropy of the weights themselves. 
It takes the latter value when the assemblages 
are completely different in composition (no shared 
species). When other measures of diversity are 
used with unequal weights, it may still be pos¬ 
sible to extract information about assemblage dif¬ 
ferentiation by comparing them to more compli¬ 
cated reference diversities; see Gregorius (2010) for 
discussion. 
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Box 6.1 Numerical example: assessing the compositional similarity of trees and seedlings 
across six sites 


We illustrate our multiple-assemblage similarity indices by 
using the frequency data collected in six forests in 
north-eastern Costa Rica 2006 by Chazdon and colleagues; 
see Norden et al. (2009) for details of the six sites. The six 
1-ha plots vary in land-use history and in protection from 
hunting. Two plots are young secondary forests: El Bejuco 
(EB, 12 years old in 2006) and Juan Enriquez (JE, 12 years 
old in 2006). Two plots are 'intermediate' secondary 
forests: Lindero Sur (LSUR, 21 years old in 2006) and 
Lindero El Peje secondary (LEPS, 29 years old in 2006). Two 
plots are in old growth forests: Lindero El Peje primary 
(LEPP, over 200 years) and Selva Verde (SV, over 200 
years). The four secondary forests were all cleared, burned, 
and used for pasture for several years and were 
subsequently abandoned. Three sites are located within La 
Selva Biological Station (LSUR, LEPS, and LEPP), and three 
sites are located 7 km to the west in Chilamate. 

Each plot was 50 x 200 m (1 ha). For 
this example, we consider two size classes of canopy tree 
species: trees (> 5 cm in DBH) including canopy palms, 
and canopy tree seedlings (< 1 cm in DBH, but > 20 cm in 
height). All trees were marked and measured for diameter 
within a 1-ha plot in each forest. Seedlings were sampled 
in five strips 2 x 200 m long, running every 10 m (0.2 ha). 
The number of species and individuals are summarized 
in Table 6.4. Note that SV old-growth forest has by far the 
highest seedling abundance; 72% are seedlings of a single 
canopy palm species, Welfia regia. Species richness of trees 
is significantly greater in the old-growth plots, but seedling 
species richness is similar (Norden et al. 2009; Table 6.4). 


Table 6.4 The observed species richness in four secondary-growth and 
two old-growth sites (numbers in parentheses denote the number of indi¬ 
viduals) 


Site 

Age in 2006 

Seedlings 

Trees 

EB young 

12 

65 (3797) 

56 (498) 

JE young 

12 

58 (2258) 

48 (546) 

LSUR intermediate 

21 

42 (777) 

51 (762) 

LEPS intermediate 

29 

47 (1303) 

79(1093) 

LEPP old-growth 

> 200 

53 (1160) 

89(618) 

SV old-growth 

> 200 

54 (5411) 

114 (832) 

Average number of species 


53.17 

72.83 

Number of species in the six plots 


97 

184 


See Box 6.1 for explanation of EB, JE, LSUR, LEPS, LEPP, and SV. 


Table 6.5 ft diversities and multiple-assemblage similarity indices 
(with s.e. in parenthesis) across the six sites for trees and seedlings. 


Index 

Sizes 

<7 = 0 

q = i 

<7=2 



(Presence/ 

(Entropy 

(Simpson 



absence) 

based) 

index based) 

p diversity 

Trees 

2.526(0.052) 

1.982 (0.026) 

1.626 (0.037) 


Seedlings 

1.824 (0.034) 

1.566 (0.011) 

1.797 (0.019) 

Multiple- 

assemblage 

similarity 

Trees 

0.695 (0.009) 

0.618(0.007) 

0.535 (0.015) 

CqN 

Seedlings 

0.835 (0.007) 

0.750 (0.004) 

0.467 (0.008) 


The 3 diversity formula is given in equation 6.9c and multiple-assemblage 
similarity in equation 6.3 or 6.12. 


Although a specialized group of tree species colonize 
abandoned pastures and become dominant in early 
secondary forests, seedling species are more likely to be 
shared with old-growth forests, due to similar conditions 
within the shaded forest understory in all plots. We 
therefore predicted that multiple-assemblage similarity 
would be lower for trees than for seedlings across the six 
forest plots. 

Here, we assess the compositional similarity indices of 
trees across the six plots. We then compare the similarity 
indices for seedlings across the six plots. In Table 6.5 we 
present the |3 diversities and multiple-assemblage similarity 
indices of orders 0,1, and 2 along with their bootstrapped 
s.e. for trees and seedlings. We use the observed species 
and abundance to compute the measures for q = 0 and 
1 because for these data the under-sampling bias is 
relatively low so that it would not affect the general 
'relative' pattern. For the measures of q = 2, we use the 
nearly unbiased estimators (Chao et al. 2008). 

Based on Table 6.5 and the multiple-assemblage 
similarity profile in Figure 6.1A, for q = 0 and 1, the 
cross-site similarity of seedlings is larger than the cross-site 
similarity of trees. However for q > 2, the result is 
reversed. This is because abundant species strongly 
influence the assemblage similarity index for q >2. This 
pattern is due to a single species that is superabundant in 
the SV old-growth forest, Welfia regia. When this species is 
excluded in the comparison, it is clear (Figure 6.IB) the 
cross-site similarity of seedlings is always larger than the 
cross-site similarity of trees for all orders. The above 
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(a) Figure la: for all data (b) Figure lb: for data excluding species Welfia regia 



Figure 6.1 The multiple-assemblage similarity profile C qN (for q = 0 - 6) and the associated 95% confidence interval: x-axis, order q; y-axis, 
similarity index C q ^. (a) For all data; (b) for data excluding species Welfia regia. 


pattern is also generally valid for any pair of plots for the 
three orders. 

The results show that, among these six plots, p diversity 
of trees is higher than for tree seedlings. This pattern 
reflects the ecological specialization of common trees in 
secondary vs old-growth forests. In contrast, tree seedlings 
established in both secondary and old-growth forest are 
more evenly mixed across the plots and do not show 
evidence of ecological segregation or differentiation. In the 
secondary forests, many of these seedlings are colonizing 
these forests for the first time (Norden et al. 2009). 
Gradually, these changes in species composition will lead to 

The complete partitioning lets us evaluate the 
multiplicative and additive approaches. Whit¬ 
taker's multiplicative approach gives the complete 
partitioning of all effective numbers of species. The 
additive approach gives the complete partitioning 
of Shannon entropy but not species richness or the 
Gini-Simpson index. The additive species richness 
P is still a useful measure, as shown below, but the 
additive Gini-Simpson |3 is confounded with a in a 
particularly inappropriate way. When a is high, this 
|3 necessarily approaches zero, even if all groups 
are completely differentiated (Jost 2006, 2007, 2008; 
Jost et al. 2010; Tuomisto 2010). One consequence of 
this is that additive |3 using the Gini-Simpson index 
is not monotonic with respect to unambiguously 


differentiation and reassembly of tree species composition 
as the shade-tolerant species become recruited in larger 
size classes (Chao et al. 2005). Since the under-sampling 
bias for the estimates of the measure of order 0 (only 
presence/absence data are used) generally cannot be 
corrected and a measure based on abundance data is more 
informative, it is suggested that the multiple-assemblage 
similarity index, particularly for measures of order q = 1 
and q = 2 based on abundance data, provides a sensitive 
and statistically robust metric for comparing compositional 
similarity of tree and seedling assemblages in forests at 
different stages of succession (Chao et al. 2008). 

increasing differentiation (Jost et al. 2010). It cannot 
be directly interpreted as a measure of differenti¬ 
ation for a given set of assemblages. It needs to 
be properly normalized before it can be interpreted 
(Jost et al. 2010). Hierarchical additive partitioning 
of this measure gives misleading results as well, 
since <x generally increases at higher hierarchical 
levels, limiting this |3 to ever-smaller values as we 
go up the hierarchical pyramid. 

Using transformations of (3 to measure compositional 
similarity 

|3 diversity based on complete partitioning is inde¬ 
pendent of a. However, we may also be interested 
in measures of the number of species (or effective 
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number of species) by which the assemblages dif¬ 
fer; this kind of measure necessarily depends on a. 
For example, if assemblage weights are taken to be 
equal, the effective number of regional species not 
contained in a typical local assemblage is given by 
i D y — i D a = i D a ('i Dp — 1). This is essentially addi¬ 
tive partitioning of effective number of species (but 
taking care to use equal weights when q is differ¬ 
ent from 0 or 1, and making sure that q D„ is cal¬ 
culated as above). Economo and Kiett (2008) used 
this partitioning to investigate the behaviour of 
Hubbell's neutral theory of biodiversity. Dividing 
this by N — 1 gives the effective number of species 
unique to a typical local assemblage. This is given 
by q D a (‘i Dp — l)/(N - 1). These measures should 
be interpreted with care when comparing regions, 
since they depend on a diversity. 

The |3 diversities for effective number of species 
ranges from 1 to N when weights are equal. They 
can be made into measures of relative composi¬ 
tional similarity by transforming them onto the 
unit interval. All such transformations inherit the 
important mathematical properties of this |3 (inde¬ 
pendence from a, density invariance, replication 
invariance, and others). Similarity measures that 
are transformations of |3 are not limited to two 
assemblages. There are many possible transforma¬ 
tions of (3 onto the unit interval, each address¬ 
ing a different aspect of compositional similar¬ 
ity (Jost 2007; Chao et al. 2008; Tuomisto 2010). 
A transformation that is linear in the propor¬ 
tion of regional diversity contained in the average 
assemblage is 

q S = (l/ 9 Dj9 — 1/N)/(1 — 1/N) (6.11) 

(Jost 2007). When q = 0, this is the multiple- 
assemblage generalization of the Jaccard index. 
When q = 2 this is the multiple-community general¬ 
ization of the Morisita-Horn index. The Jaccard and 
Morisita-Hom measures are members of the same 
family and are connected by a continuum of similar¬ 
ity measures which differ only in their sensitivity to 
species relative abundances. 

The overlap measure C,n discussed earlier is also 
a transformation of true (3 diversity: 

C, N = [(1/" D^- 1 - (1/N)‘ ?-1 ]/[1 - (1 /NT 1 ] 

- (6.12) 


This family of measures includes the multiple- 
assemblage generalizations of the Sorensen index, 
Horn index, and Morisita-Horn index. All three 
of these standard indices are measuring the same 
aspect of assemblage similarity, only differing in 
the weights they give to species frequencies. The 
Morisita-Hom index is unique in belonging to both 
families of similarity measures; it is the both lin¬ 
ear in shared diversity and linear in the degree of 
overlap. It therefore has a wider range of interpre¬ 
tations than other similarity measures. It can be 
useful to transform the 3 in equation 6.10c to a 
normalized measure between 0 and 1, so that we 
can compare compositional similarity across sev¬ 
eral sets of assemblages. This transformation gener¬ 
alizes Horn's overlap measure (equation 6.4) to the 
unequal-weight case: 
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We recommend that investigators calculate at least 
Con, Clv, and C 2 ,v for their data (Box 6.1). They 
could also use equation 6.12 to graph a continu¬ 
ous similarity profile for a range of q (say, from 
q = 0 to q = N). As an example of the use of C l]N , 
consider the assemblage sets A-D in Table 6.3. In 
Set A, all species in each assemblage are equally 
abundant, thus the three similarity indices are about 
the same Con = C 2 n ^ Cjn- In Set B, C 2 n > Cjn, 
implying that the dominant species are more sim¬ 
ilar than the average species. In contrast, for Sets 
C and D, C 2 n < Cin, implying that the dominant 
species are less similar than the average species. 
C 2 n, like the Simpson index it is based on, is more 
sensitive to the dominant species than Cin- We can 
make comparisons across the four sets because all 
measures are normalized to the range of 0 and 1. 
The three similarity indices in both Sets A and B 
are consistently higher than those in Sets C and 
D. Based on the Horn index. Set D has the low¬ 
est similarity. The proposed indices Con, Cin, and 
C 2 n are featured in the Program SPADE (Chao & 
Shen 2003a), which can be freely downloaded from 
http://chao.stat.nthu.edu.tw/softwareCE.html. 
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6.2.3 Statistical estimation of assemblage 
differentiation and similarity 

In practice, all differentiation and similarity mea¬ 
sures need to be estimated from a relatively small 
sample taken from each of the study assemblages. 
Traditionally, species frequencies in the samples 
are assumed to give good estimates of the species 
frequencies in the assemblage itself. The sample 
species frequencies are therefore substituted for 
the assemblage species frequencies in the theoreti¬ 
cal formulas for differentiation and similarity. The 
resulting estimate is called the maximum likeli¬ 
hood estimator (MLE). This approach is statisti¬ 
cally valid only when sample size is relatively large 
compared to the number of species, so that almost 
all species are observed in samples. When sample 
size is relatively small, so that a large fraction of 
rare species are missed in samples, our simulations 
show that the MLEs for differentiation indices are 
generally biased upwards and thus biased down¬ 
wards for similarity measures. Sampling limitations 
create challenges for making accurate estimates 
of differentiation and similarity measures. More¬ 
over, any bias-reduction method valid for estimat¬ 
ing a diversity in one assemblage cannot be directly 
extended to estimate y diversity. This is because the 
abundance vectors cannot be simply pooled across 
assemblages unless the sampling efforts are equal in 
all assemblages, so that sample sizes reflect assem¬ 
blage sizes. 

The magnitude of the under-sampling bias 
depends on the diversity order q. For q > 2, a nearly 
unbiased estimator exists for any measure based 
on the Simpson index. Chao et al. (2008) provided 
such an estimator for the index C 9 ^. Similar estima¬ 
tors can be constructed for (3 diversity or differen¬ 
tiation measures. For q = 1, there exists no nearly 
unbiased estimator for any measure based on Shan¬ 
non's entropy. The unseen species have moder¬ 
ate effect on the index. A two-sample jackknife 
method (Schechtman & Wang 2004) that removes 
most of the under-sampling bias of the MLE has 
been applied to Horn's heterogeneity and overlap 
indices (Norden et al. 2009; Gofiner et al. 2009). A 
coverage-based Horvitz-Thompson approach, sim¬ 
ilar to that proposed by (Chao and Shen 2003b) for 
estimating entropy in one assemblage, is currently 


under development for multiple-assemblage mea¬ 
sures. For q = 0, the under-sampling bias could 
be substantial, especially for highly diverse assem¬ 
blages. Only an accurate lower bound for species 
richness (or shared species richness) is feasible 
(Chao 2005; Pan et al. 2009). 

Regardless of how the similarity or differen¬ 
tiation is estimated, the result will always have 
some statistical uncertainty. Ecologists traditionally 
quantify this uncertainty by testing their results 
against the null hypothesis that there is no differ¬ 
entiation, and reporting a P value calculated by 
randomized resampling. This is rarely an appro¬ 
priate approach, since the question of interest is 
not 'Is there any differentiation?'(there will always 
be some differentiation among natural populations) 
but rather 'How much differentiation is there?' The 
problem is one of parameter estimation rather than 
hypothesis testing. In parameter estimation, statis¬ 
tical uncertainty is expressed by means of a con¬ 
fidence interval. If sample size is sufficient, boot¬ 
strapping can be used to calculate an approximate 
confidence interval around the estimated value of 
the measure (Chao et al. 2008). If the confidence 
interval includes 0 (for relative differentiation) or 
1 (for relative similarity), then the observations are 
insufficient to reject the null hypothesis of zero dif¬ 
ferentiation. If the interval does not include 0, not 
only can the investigator reject the null hypothesis 
but he or she will also know the range of magni¬ 
tudes within which the true population value of 
the measure lies. This magnitude is the scientifically 
important aspect of differentiation or similarity. The 
magnitude of good differentiation and similarity 
measures can be understood by referring to simpler 
reference assemblages with the same value of the 
measure. 


6.3 Prospectus 

Exciting advances in the mathematics of diver¬ 
sity and differentiation have been made within 
the fields of genetics, economics, information sci¬ 
ence, and physics, but these advances are largely 
ignored outside of the disciplines that generated 
the advance. In addition the extensive statistical 
literature on divergence measures (see Pardo 2006) 
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has yet to be tapped by ecologists. Interdisciplinary 
synthesis of these developments holds great poten¬ 
tial. It appears that all disciplines are independently 
converging on the same concepts, and we would 
get there faster if each science was aware of the 
progress made by the other sciences. Order 1 mea¬ 
sures (those based on Shannon entropy) have many 
important mathematical properties not shared by 
other frequency-sensitive diversity measures, and 
come closest to the ecologists' intuitive idea of 
diversity and differentiation (Jost 2007; Jost et al. 
2010). If ecologists or population geneticists can 
derive the expected behaviour of these measures 
under a neutral model, following the lead of Sher- 
win et al. (2006), these measures would become the 
standard in the field. 

Standard similarity measures treat all species as 
equally distinct. Yet in many important ways, an 
assemblage of rats and an assemblage of mice are 
more similar than an assemblage of rats and an 
assemblage of whales. Some recently developed 
measures take into account functional, phyloge¬ 
netic, or other kinds of species differences (Graham 
and Fine 2008; Allen et al. 2009; Cadotte et al. 2009; 
Pavoine et al. 2009). However, these measures are 
usually based on additive partitioning of a general¬ 
ization of the Gini-Simpson index (e.g. Hardy and 
Senterre 2007). The resulting differentiation mea¬ 
sure is not acceptable (Table 6.2, see also Jost (2008) 
and Hardy and Jost (2008)). Ricotta and Szeidl 
(2009) have recently extended the diversity of order 
2 to take into account phylogenetic or functional 
differences using multiplicative partitioning. It is 


important to extend this measure to other orders, 

and resolve issues relating to the definition of a 

diversity. 

6.4 Key points 

1. The general overlap measure C cj ^ is the best 
overall measure of relative compositional 
similarity. It includes the Sorensen, Horn, and 
Morisita-Horn indices and their multiple- 
assemblage generalizations, which take global 
similarity into account. 

2. The Morisita-Hom index is especially useful as 
an abundance-based similarity measure, since its 
expected value can be predicted from the para¬ 
meters of HubbelTs neutral model. 

3. Some common similarity and differentiation 
measures (traditional additive (3 and similarity 
measures, some distance measures, the Bray- 
Curtis measure, and between-group component 
of variance) do not behave as expected and do 
not necessarily reflect compositional similarity 
or differentiation. 

4. Assemblage differentiation is intimately related 
to the concept of (3 diversity. Multiplicative par¬ 
titioning of effective number of species best 
captures the notion of |3 diversity. It can be 
transformed into the Jaccard and C qN similarity 
indices, among others. 

5. Estimation of differentiation and similarity is 
statistically challenging, especially for measures 
based on species richness. Specialized estimators 
should be used. 



CHAPTER 7 


Measuring biological diversity 
in time (and space) 

Anne E. Magurran 


. .we forget that each species, even where it 
most abounds, is constantly suffering enormous 
destruction at some period of its life, from ene¬ 
mies or from competitors for the same place and 
food; and if these enemies or competitors be in 
the least degree favoured by any slight change 
of climate, they will increase in numbers; and as 
each area is already fully stocked with inhabi¬ 
tants, the other species must decrease.' 

—(Darwin 1859) 


7.1 Introduction 

Communities change through time as well as space. 
The knowledge that species abundances and identi¬ 
ties vary over time helped shape Darwin's thinking 
when he was developing his ideas on natural selec¬ 
tion. It also underpins key ecological concepts such 
as MacArthur and Wilson's theory of island bio¬ 
geography (MacArthur & Wilson 1967) and biodi¬ 
versity models including those proposed by Fisher 
(Fisher et al. 1943), Preston (1948) and Hubbell 
(2001). However, although temporal processes are 
indisputably important in ecology, temporal pat¬ 
terns of biodiversity have received much less atten¬ 
tion than spatial ones. In part this is because there 
are relatively few examples of long time series of 
biodiversity data. Indeed, in some cases long-term 
studies have been cancelled by funding agencies 
because they were deemed to be merely data collec¬ 
tion and not cutting-edge science. Events in recent 
years have proved this view to be short-sighted 
and one of the most pressing challenges that ecol¬ 
ogists face today is being able to detect the signa¬ 
ture of anthropogenic activities on biological diver¬ 


sity relative to background rates of change. Dan- 
win recognized that even small shifts in climate 
will modify community composition and structure, 
but in the absence of good baseline data it is dif¬ 
ficult to distinguish temporal changes in biodiver¬ 
sity that can be attributed to climate change from 
those that might have happened anyway. The lim¬ 
ited availability of this type of information is also 
one of the reasons why the ambitious goal of the 
2002 World Summit of Sustainable Development, 
to reduce the rate of biodiversity loss by 2010, has 
proved impossible to meet. A better appreciation 
of temporal patterns is equally relevant to con¬ 
servation managers since conservation targets are 
often set in relation to static benchmarks. Species 
abundances vary through time, and species will 
inevitably be lost from or gained by a community 
or nature reserve. Enlightened conservation plan¬ 
ning (e.g. Rodrigues et al. 2000) needs to recog¬ 
nize this and judge the success of management 
decisions against natural rates of turnover rather 
than treating species in the same way that curators 
view a collection of Van Gogh pictures in an art 
museum. 

The temporal changes that all communities expe¬ 
rience take one of two forms. First, communities 
may move from one state to another. This directional 
change can reflect an intrinsic event, for instance 
when the community undergoes succession. Alter¬ 
natively, extrinsic factors such as disturbance, pol¬ 
lution, invasive species or a catastrophe can lead to 
shifts in structure. Of course as Dornelas et al. point 
out in Chapter 17 it is not always easy to distinguish 
natural and human imposed disturbance. Second, 
all communities undergo non-directional change. This 
temporal turnover—the result of immigration and 
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local extinction, as well as variation through time in 
the abundance of the resident species—is linked to 
many phenomena that ecologists study, including 
competition, predation, and population dynamics. 
The status of a community, that is, whether it is 
undergoing directional as well as non-directional 
change, needs to be taken into account in any study 
that seeks to evaluate change in relation to natural 
baselines. 

It is belatedly being recognized that temporal 
patterns of diversity deserve much greater atten¬ 
tion and for this reason there have, until now, 
been relatively few attempts to develop appropri¬ 
ate methods. The literature emphasizes methods 
developed for spatial data and it is often assumed 
that approaches used to assess diversity in a spa¬ 
tial context can be applied, without modification, 
to temporal questions. In fact, although this is 
often fine, there are pitfalls for the unwary user, 
for example spatial comparisons usually examine 
different subsets of individuals whereas the same 
individuals may be sampled repeatedly if a single 
community is tracked through time. In what fol¬ 
lows I draw together information on methods that 
can be used to measure temporal patterns, provide 
guidelines on their application, and identify top¬ 
ics that would benefit from further development. 
The chapter begins by considering timescales of 
change and the spatial, temporal, and taxonomic 
boundaries of a community or unit of study. It then 
presents a brief overview of the different ways in 
which one can measure diversity, and how tem¬ 
poral measurements fit into this scheme. I dis¬ 
tinguish between methods that assess change in 
diversity over time—this can be change in either 
the amount of diversity or the composition of 
the assemblage—from those that combine tempo¬ 
ral information to glean insights into the struc¬ 
ture of the assemblage. The chapter reviews clas¬ 
sical measures of temporal turnover and shows 
that species time curves can be also be used to 
assess turnover. It discusses the equivalence of 
diversity patterns over space and time and briefly 
examines methods of partitioning of diversity into 
both spatial and temporal components. In addition, 
approaches that can be used to evaluate temporal 
change in species abundance distributions are dis¬ 
cussed. 


7.2 State of the field: timescales of 
change and community boundaries 

As Preston (1960) observed, the conclusions that 
an ecologist reaches about biological diversity in 
a given locality or assemblage will be shaped by 
the time frame over which data are collected. Pre¬ 
ston suggested that there are three main temporal 
scales: sampling time, ecological time, and evolu¬ 
tionary time. In his view sampling time encom¬ 
passes the fluctuations that occur in species pres¬ 
ence and species abundance, ecological time takes 
the form of succession, while evolutionary time cor¬ 
responds to speciation and extinction. Evolutionary 
timescales are implicit in phylogenetic (see Chap¬ 
ter 14) and palaeontological (see Chapter 19) inves¬ 
tigations, while models that seek to explain patterns 
of species abundance (see Chapters 9 and 10) draw 
on the features and interactions, such as resource 
limitation and competition, that characterize eco¬ 
logical time. 

Sampling time seems on first glance the least 
interesting of Preston's three categories, but in prac¬ 
tice it can have a substantial influence on the 
recorded level of biological diversity. There are two 
interrelated questions to be considered here: how 
well has the assemblage been sampled and how 
long has it been sampled for? As Bonar et al. (Chap¬ 
ter 2) and Buckland et al. (Chapter 4) reveal, it can 
be challenging to sample a community thoroughly. 
Species may not be equally detectable, they may 
vary in their diurnal activity levels and different 
life stages and sexes may have different behaviour 
patterns. 

However, even if it is possible to achieve a fair 
sample of the species that are present, it is not 
always easy to decide where the temporal bound¬ 
aries lie. It is possible, in theory at least, to take 
a perfect snapshot sample of a community. For 
instance, it would be feasible, given sufficient num¬ 
bers of trained assistants, to comprehensively assess 
the diversity of the herbaceous vegetation in a 
woodland or of birds on a moorland in a single 
day. This approach would, however, exclude plant 
species that flower at a different season or that occur 
in the seed bank. It also ignores bird species that 
breed on the moorland but are temporarily absent 
during migration, while including those migrants 
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flying through en route to their breeding site. There 
is a distinction then between an expressed commu¬ 
nity and the potential community. The expressed 
community is everything that is present as a given 
point of time and that can be sampled (assuming no 
sampling error). The potential community consists 
of the species that are part of a community when 
it is evaluated over a longer time period, but not 
necessarily there at a single point in time. These dis¬ 
tinctions matter if we are trying to deduce the rich¬ 
ness (or some other measure of diversity) of a com¬ 
munity since the species time curve increases with 
sampling in a similar manner to the species area 
curve (see section 7.4.1). For example the number 
of species in a well-documented estuarine fish com¬ 
munity (Henderson 2007) is approximately con¬ 
stant from year to year, but rises through time when 
samples are accumulated (see Fig. 8.5a). It is not 
clear if the richness of this community is 81 species 
(the total number detected over nearly 30 years of 
sampling) or 39 (the average number of species 
detected in any year). Decisions about temporarily 
absent species also come into play when species 
composition is considered. Thus assessments of (3 
diversity, which ask how different two or more 
localities are in terms of the actual species found 
there, will depend on which species are considered 
to belong to an assemblage, which in turn is linked 
to the time frame over which it is evaluated. These 
considerations are relevant to estimates of the num¬ 
ber of 'unseen' species (see Chapter 4). 

One way of dealing with community member¬ 
ship is to use biological knowledge to decide 
which species belong (Ulrich & Ollik 2004; Ulrich 
& Zalewski 2006; Magurran 2007). For exam¬ 
ple, investigations of bird diversity often exclude 
non-breeding or migratory species. Abbot (1983) 
argued that it is 'absurd' to include migrants 
in assessments of temporal turnover. Alterna¬ 
tively, investigators can focus on core species, that 
is those that occur regularly in the community 
(Magurran & Henderson 2003). Of course both 
approaches depend on good background informa¬ 
tion and may not be possible in poorly documented 
assemblages. 

Ecologists typically define a community as a 
group of interacting species in a delimited geo¬ 
graphical area. Researchers, however, often exam¬ 


ine communities that lie within narrow taxonomic 
boundaries, for example farmland bird species or 
cichlid fish. The reasons for doing this are usu¬ 
ally practical as it is difficult to sample organisms 
that differ markedly in body size (and would thus 
require a range of sampling methods), whose life 
histories occur at a different tempo (such as forest 
trees and annual herbs), or whose population sizes 
are not comparable (e.g. microbes and mammals) 
(see Loehle (2006) and Magurran (2008) for more 
discussion of these points). Nor is it necessarily 
* straightforward to make inferences about the bio¬ 
logical diversity of other taxa on the basis of data 
from one or a few groups. There has been an exten¬ 
sive debate on the effectiveness of different indica¬ 
tor taxa (e.g. Bibby 1999; McGeoch & Gaston 2002; 
McGeoch et al. 2002) and in many instances the con¬ 
clusion that emerges is that some, but by no means 
all, taxa show similar trends. In one case a study of 
14 different taxa in the Brazilian Amazon revealed 
some groups (notably birds and dung beetles) to be 
cost-effective and informative indicators, but also 
found that biodiversity patterns are not correlated 
across all groups (Gardner et al. 2007). 

Policy makers often consider biodiversity across 
all taxa and may use the term in a generic way with¬ 
out spelling out what is meant by it (EASAC 2009). 
However, the conclusions that are drawn about bio¬ 
diversity, such as whether it is being lost or whether 
conservation management plans are effective, will 
inevitably be shaped by the way the community 
is defined. This will include not just spatial and 
taxonomic boundaries, but also its temporal extent. 

7.3 What is being measured? 

Biodiversity assessment is a two-step process. The 
first step involves selecting the measure or mea¬ 
sures of diversity that will be used. As noted 
throughout this book, a range of approaches is pos¬ 
sible. 'Classical' diversity measures tend to quantify 
the total (or mean) number of entities (richness) 
in the population or to estimate the variability in 
the relative abundance of these entities (evenness) 
(see Box 7.1). In addition there are many measures 
that blend richness and evenness. The well-known 
Simpson and Shannon indexes are two examples. 
As Box 7.1 illustrates, the same approach can be 
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Box 7.1 Biodiversity assessment 


Biodiversity assessment is a two-stage process. The first 
stage is deciding which diversity measure to use, the 
second is examining that measure of diversity in a spatial 
and/or temporal context. For example, one might describe 
the diversity of a sampling locality (depicted as a small 
shaded cube in the diagram) using the measure of choice 
and then go on to ask how the diversity of a site changes 
through a slice of time (represented as the larger shaded 
section). In both cases it is essential to be clear about the 
aim of the investigation, and, where relevant, the 
hypothesis that is being tested. 


Richness 


Landscape 


Evenness 



Choosing a measure 

In this example there are three 'categories' of 
interest—genes, species, and traits—each of which 
captures one aspect of biodiversity. Richness records the 
number of different types of entity while evenness 
evaluates the extent to which they are equally represented. 
They lie at either end of a continuum populated by the 
many heterogeneity measures that blend richness and 
evenness. Although investigators generally concentrate on 
one category of diversity, they can all be measured in 
equivalent ways. For example, if the focus is on species, 
biodiversity can be represented as species richness, or by a 
diversity measure such as the Shannon or Simpson index, or 
by an evenness index. Alternatively, if genes are the 
category of interest, diversity might be expressed as allelic 
richness or heterozygosity. Heterozygosity is statistically 


equivalent to Simpson's index. Traits can be treated in the 
same way. However, while all of these approaches measure 
biodiversity, the units in which this diversity is expressed, 
and the insights gained, can differ. Chapters 4 to 10 focus 
on species diversity measures while Chapters 13 to 16 deal 
with the alternative currencies of traits, phylogenies, and 
genes. 


Assessing diversity over time (and space) 

Investigators often want to go beyond a snapshot measure 
of diversity at a single site. There are a number of different 
ways in which temporal diversity can be examined. One 
approach is to ask how rapidly diversity is accumulated 
when the time period over which data are collected is 
extended. Species-time curves (see section 7.4.1) are the 
best-known example of this, although of course 
diversity-time curves could be constructed using different 
measures and these might provide novel insights into 
community dynamics. A different set of questions asks how 
much the site (or community) has changed between certain 
points in time. Turnover indexes (section 7.4.2) and species 
abundance distributions (section 7.4.3) can be used for this 
purpose. Alternatively, the goal might be to detect a trend 
in diversity and this can be achieved by plotting the index of 
choice against time (section 7.4). Once again this is 
commonly done using conventional diversity statistics (such 
as species richness or the Shannon index), but there is no 
reason why measures that take account of traits or genes 
could not be used (and might indeed be highly informative). 
A related, but distinct, question is whether there is a 
change in this trend (section 7.5). 

Other sets of questions might be posed about this 
landscape. It might, for example, be interesting to know if 
spatial turnover (classical j8 diversity, Chapter 6) has 
changed through time, or whether spatial or temporal 
turnover is greater, or what the structure of the species 
abundance distribution is when temporal data are 
accumulated (section 7.6). The key point is that biodiversity 
assessment means deciding not just which measure to use, 
but also being precise about the question that it is being 
used to answer. 


used for different categories of measure, different conclusions about biodiversity, as can 
such as genes, species, or traits. However, statistically analogous measures in different 
different measures within a category can yield categories. 
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The next step is to make inferences about diver¬ 
sity in the wider context. There is a large litera¬ 
ture on the virtues and vices of competing diversity 
statistics but much less attention has been directed 
to the challenges of drawing robust conclusions 
about diversity at the landscape scale or across 
varying time periods from a series of samples. 
These broader questions can take different forms. 
For example, one might wish to know how diver¬ 
sity changes through time, for instance in relation to 
climate change, or along a spatial gradient, such as 
one related to altitude or disturbance. Alternatively, 
the goal might be to examine the difference in com¬ 
position in time or space and to evaluate turnover 
across landscape scales (see Box 7.1). 

7.4 Assessing change through time 

7.4.1 Temporal turnover: species time curves 

Ecologists have long recognized that species rich¬ 
ness increases with sampling duration (Grinnell 
1922; Fisher et al. 1943; Preston 1960; Williams 
1964). The idea that a species time curve can be 
used to assess temporal turnover in the same way 
as species area curves are used to gauge spatial 
turnover is more recent but is gaining ground 
(Rosenzweig 1998; White et al. 2004; White 2007; 
van der Gast et al. 2008). White et al. (2006) exam¬ 
ined species time curves from nearly 1000 commu¬ 
nity time series representing a range of taxa. Their 
analysis demonstrated the ubiquity of species time 
curves, showed that they are usually well described 
by both power and logarithmic functions, and 
found that when power curves are used the expo¬ 
nent typically ranges between 0.2 and 0.4. Interest¬ 
ingly, lower rates of turnover (denoted by a lower 
value of the exponent) are associated with higher 
richness. Shurin (2007) puts forward a number of 
explanations for this observation. These insights 
underline the need to take account of timescale 
in both fundamental and applied work. However 
Preston's ideas of equivalence in species time and 
species area curves also have practical implications 
for those who wish to assess diversity. 

Using longterm data from grassland commu¬ 
nities Adler & Lauenroth (2003) found, as Pre¬ 
ston predicted, that new species are added to the 


species list more slowly when larger areas are sur¬ 
veyed. Indeed Adler et al. (2005) argued for a 
general species-time-area relationship using data 
from eight independent assemblages. Their analy¬ 
sis indicated that species richness is linked to both 
sampling duration and sampling area. They also 
found a negative time-area interaction. This means 
not only that the slope of the species-time curve 
is lower when larger areas are sampled but also 
that the slope of the species-area curve is lower 
.when sampling occurs over longer periods. It is 
thus essential to clearly specify both the area and 
time period of sampling in an estimate of species 
richness (Adler & Lauenroth 2003). It also means 
that, as with estimates of species richness in relation 
to area (see also Chapters 4 and 20), it is wrong to 
calculate the number of species present in a shorter 
(or longer) time period than that over which the 
data have been collected by simple division (or mul¬ 
tiplication). Indeed White et al. (2006) note that, on 
average, the number of species detected in a lOyear 
period is in the region of twice that recorded in a 
single year. 

7.4.2 Temporal turnover: turnover indexes 

An alternative approach is to use an index to cap¬ 
ture temporal turnover in species composition. The 
percentage of species shared between successive 
time periods is one example (Lekve et al. 2002), the 
ratio of mean species richness to potential species 
richness another (Romanuk & Kolasa 2001). 

Temporal turnover ( t ) measures often draw on 
MacArthur & Wilson's (1967) ideas on the dynamic 
interplay between immigration and local extinc¬ 
tion. Brown and Kodric-Brown (1977), for example, 
proposed: 

b + c 

Si + S2 

where b is the number of species present in the first 
census only, c is the number of species present in the 
second census only. Si is the total number of species 
present in the first census, and S 2 is the total number 
of species present in the second census. However, as 
Diamond and May (1977) noted, the census interval 
affects the estimate of turnover. They proposed an 
index to take account of this: 
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where l is the number of species lost (locally 
extinct), g is the number of species gained (immi¬ 
gration), S is the total number of species present, 
and ci is the census interval. 

Another common approach is to use similarity 
(or dissimilarity) measures. For instance, Zamora 
et al. (2007) used complementarity (a type of dis¬ 
similarity measure (Colwell & Coddington 1994)) 
when assessing temporal shifts in the biodiversity 
of Mediterranean agroecosystems, while Chytry 
et al. (2009) selected the Jaccard index to examine 
turnover in a successional heathland in the Czech 
Republic, as did Sepkoski (1988) in his analysis 
of the diversity of marine macrofossils during the 
Palaeozoic. Many (dis)similarity (and complemen¬ 
tarity) measures—such as the Jaccard index—use 
presence/absence data. Measures that take account 
of species abundances are also available. The Soren¬ 
son quantitative (Bray-Curtis) and Morisita-Horn 
measures are examples of these. Turnover measures 
are reviewed by Jost et al. (Chapter 6). Estimates 
(viceroy.eeb.uconn.edu/estimates) can be used to 
calculate some of the more useful turnover mea¬ 
sures and as well as a range of other biodiversity 
statistics. 

Similarity measures (and their relatives) have 
usually been devised to make comparisons between 
geographical areas. There are few problems asso¬ 
ciated with their use when assessing temporal 
turnover, but one exception is the issue of the cen¬ 
sus interval. Of course if a study examines turnover 
over identical time periods this is probably not 
a difficulty (although there are conceivably con¬ 
cerns related to differences in detectability (see 
Chapter 3)). 

Probably the simplest approach is to plot 
turnover in relation to census interval, as Rus¬ 
sell et al. (1995) did in their investigation of tem¬ 
poral turnover in bird communities on islands 
around Britain and Ireland. Another example of 
this is shown in Fig. 7.1. As long as there is rea¬ 
sonable sample coverage it is possible to exam¬ 
ine the relationship between turnover and cen¬ 
sus length. Given the dynamic na^yre of ecologi¬ 
cal communities an increased gap between samples 
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Figure 7.1 This graph shows temporal turnover (measured as 1 minus 
the Morisita-Horn index) plotted against the census interval for an 
assemblage of estuarine fish from the UK's Bristol Channel, censused for 
27 years. Further details of the study system are given in Chapter 8. 
Although there is a lot of scatter (this is typical) the trend is for increasing 
turnover with time. 

will normally be reflected in greater turnover. Some 
localities or communities may nonetheless exhibit 
higher than expected turnover and this could be 
related to ecological factors (such as distance from 
the source of immigrants) or be of management 
interest (for instance unexpectedly high rates of 
turnover in a nature reserve). 

Plotting values of temporal turnover against cen¬ 
sus interval is an easy way to visualize change 
and can be used to make comparisons amongst 
sites (for example by comparing the slopes of lines 
fitted to the relationship). Alternatively, it is pos¬ 
sible to work from an a priori prediction about 
the expected pattern of temporal turnover. Russell 
et al. (1995), for example, developed an equilib¬ 
rium model (sensu MacArthur and Wilson 1967) 
to explain temporal turnover in their island bird 
communities. Although this model provided a rea¬ 
sonable fit to the data, it was not entirely satisfac¬ 
tory. Russell and his colleagues then devised an 
improved non-equilibrium model and used this to 
compare turnover amongst islands. 

7.4.3 Using species abundance distributions 
to evaluate change 

Species abundance distributions (SADs) are an 
appealing way of examining biological diversity as 
they provide an overview of community structure 
and are potentially more informative than a single 
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index (McGill et al. 2007 and see Chapter 9). What 
SADs do not do, by and large, is to provide infor¬ 
mation on species composition. This is generally an 
advantage as it means that communities with little 
or no taxonomic overlap can be compared. It can, 
however, be a drawback if the goal is to ascertain 
how the same community changes through time 
since the investigator may wish to know if a shift 
in the abundance distribution is accompanied by a 
change in composition. 

Investigators often want to find out if SADs dif¬ 
fer from one another. A straightforward method 
of doing this is to examine the slope of a rank 
abundance distribution (Tokeshi 1993; Magurran 
2004). For instance, the rank abundance distribu¬ 
tion of species present in heavily fertilized plots 
in the Park Grass experiment at Rothamsted, in 
southern England (Lawes et al. 1882), has become 
much steeper through time in response to increased 
dominance by one or two species and loss of 
species richness (Tokeshi 1993; Magurran 2004). The 
Q statistic (Kempton & Taylor 1978; Kempton & 
Wedderburn 1978)—the interquartile slope, where 
cumulative number of species is plotted against 
log abundance—can also be used to make com¬ 
parisons between assemblages and is less influ¬ 
enced by changes in species richness or the pres¬ 
ence of a handful of extremely abundant or very 
rare species. Another approach is to use the obser¬ 
vation that a particular model fits an assemblage 
to make inferences about its structure. As McGill 
points out in Chapter 9 (and see also Magurran 
(2004) and McGill et al. (2007)), a range of models 
have been proposed to describe and explain SADs. 
For example, communities at the early stages of 
succession are often well described by a geometric 
(Motomura 1932) or log series (Fisher et al. 1943) 
model, while a log-normal (Preston 1948) model is 
usually a better descriptor of a mature community 
(May 1975; Magurran 2004). Connolly and Dornelas 
(Chapter 10) review the techniques developed for 
fitting and evaluating SADs but also remind us 
that care needs to be taken when drawing ecolog¬ 
ical conclusions from the observation that a certain 
model provides a good fit. 

The above techniques for examining and compar¬ 
ing SADs are well established but may not provide 
the resolution needed to distinguish temporal tran¬ 


sitions in community structure. Some recent devel¬ 
opments that appear to hold promise for temporal 
analyses include the following: 

i It is becoming clear that SADs vary in the extent 
to which they are multi-modal (Dornelas & Con¬ 
nolly *12008; Dornelas et al. 2009) so tracking 
changes in modality through time could be infor¬ 
mative. 

ii The empirical cumulative distribution function 
(see Dornelas et al. (Chapter 17) for a discus¬ 
sion of this approach) appears to be a useful 
method of assessing the effects of anthropogenic 
disturbance on communities and may have gen¬ 
eral application in identifying changes in SADs 
through time (and space). 

iii Although classical SADs do not consider species 
identity there is increasing recognition that it can 
be revealing to do so, and methods that track 
labelled species using a SAD framework are 
being developed (MacNally (2007) and Chap¬ 
ter 8). One approach that appears to be particu¬ 
larly promising is the use of rank clocks and the 
development of statistics that track ranks, such 
as mean rank shift (Collins et al. 2008). 

iv Methods of comparing community composi¬ 
tion through time, for example by using the 
Euclidean distance to measure the difference 
in composition and structure between pairs of 
assemblages, are being developed (Collins et al. 
2000; Thibault et al. 2004). 

v There are some interesting new approaches to 
modelling and assessing resilience in ecologi¬ 
cal communities. These may ultimately improve 
our understanding of the manner in which 
communities change through time and pro¬ 
vide improved methods of assessing this change 
(Thrush et al. 2009). 

7.4.4 Assessing change using biodiversity 
indexes 

There is another simple way of establishing if diver¬ 
sity changes through time and whether this change 
has a trend, and that it to select a diversity measure 
(such as the Shannon or Simpson index) and plot it 
against time (see, for example. Fig. 7.2a). Buckland 
et al. (2005) provide guidance on tests that can be 
used to assess the statistical significance of changes 
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Figure 7.2 A straightforward way of exploring changes in an ecological 
community is to plot the value of a diversity measure against time. This 
example uses data from the Bristol channel estuarine fish assemblage (as 
in Figure 7.1) and shows (a) Fisher's a (Fisher et al. 1943; Magurran 
2004), and (b) taxonomic distinctness (Clarke & Warwick (1998) and see 
also Chapter 14). In this example diversity fluctuates by a small amount 
from year to year, but does not show an increasing or decreasing pattern 
through time. 

and explain how to extend this approach to exam¬ 
ine trends in biological diversity at a regional level. 

This method has the merit of being straightfor¬ 
ward and intuitive. It will be subject to a decision 
about which diversity index is most appropriate 
(see Magurran (2004) and Chapters 5 and 6 for dis¬ 
cussion of some options), whether or not species 
vary in their detectabilities and if this changes 
through time (see Chapter 3), whether or not sam¬ 
pling protocols are appropriate (see Chapter 2 and 
Magurran et al. 2010), and so on. Similar consider¬ 
ations apply to the other approaches described in 
this section. If the goal is to ask if there is a trend 
in biological diversity other issues come into play, 
including how the baseline is decided. Some meth¬ 
ods, such as the Living Planet Index (Loh et al. 2005) 
choose an arbitrary year, set diversity at unity, and 
measure change in relation to that. In the case of 
the Living Planet Index trends are calculated across 
each species and then combined. An arbitrary base¬ 
line can work well in practice but is ecologically 
unrealistic as it ignores the inherent variability of 
natural assemblages. Moreover, the choice of base¬ 


line can have dramatic influences on the conclu¬ 
sions drawn. For example, it can appear that fish 
stocks have made a substantial recovery if present- 
day populations are compared with those in the 
recent past. However, examination of a longer time 
series can show that present stocks are extremely 
low relative to their historical levels (Hutchings 
2000 ). 

The indexes discussed thus far in this chapter 
are conventional ones in the sense that their cur¬ 
rency is species. There is, however, no reason why 
other approaches could not be used. For exam¬ 
ple, one might plot changes in taxonomic distinct¬ 
ness in relation to time (see Fig. 7.2b). Recently, 
Pavoine et al. (2009) have developed methods 
for assessing phylo-|3-diversity and used these to 
reveal substantial declines in the diversity of a 
heavily fished group of rockfishes. This method¬ 
ology draws on information about phylogenetic 
and taxonomic diversity (see also Chapter 14) and 
can be used to evaluate both spatial and tem¬ 
poral diversity patterns. Indexes based on traits 
and functional diversity (see Chapter 13) are other 
options. 


7.5 Measuring change in the rate 
of change 

The preceding discussion has focused on the meth¬ 
ods used to determine whether or not the biological 
diversity of a community changes over time. Mea¬ 
suring change in the rate of change is an allied but 
distinct issue. Researchers and managers increas¬ 
ingly wish to know if changes in a community or 
system are slowing or increasing. The 2010 target, 
which set out to reduce the rate of biodiversity loss, 
is an example of this type of question. The slope of 
a plot of a diversity index against time (as described 
in section 7.4.4) represents the rate of change in bio¬ 
diversity (Magurran et al. 2010). The second deriva¬ 
tive of the trend line reveals the change in slope and 
hence measures the rate of change. For example, in 
a case where biodiversity was being lost, a posi¬ 
tive second derivative would indicate an improved 
situation (that is, a reduction in the rate of loss) 
whereas a negative second derivative would sug¬ 
gest a worsening scenario. Fewster et al. (2000) and 
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Buckland et al. (2005) provide methods and explain 
how uncertainty measures can be calculated. 

7.6 Using temporal change to shed 
light on community structure 

Rather than assessing change through time, an 
investigator may wish to draw on information 
about the dynamic nature of a community to gain 
a deeper understanding of its structure. One of the 
central questions in ecology is why some species 
are abundant while many are rare (discussed fur¬ 
ther in Chapters 8 and 9). Temporal data can reveal 
how the relative abundance of core and occasional 
species affects the abundance distribution (Magur- 
ran & Henderson 2003; Loehle 2006; Magurran 
2007). However, as McGill (2003a) cautions, exam¬ 
ining a community's structure by the accumulation 
of many small samples collected through time (or 
space) can lead to artefacts, for example by giving 
undue prominence to rare species. Large data sets 
based on repeated temporal and/or spatial sam¬ 
pling have been widely used to test a variety of 
SAD-related hypotheses such as Preston's (1948, 
1960) veil line. Nonetheless, such data sets can blur 
the community boundaries (as described in sec¬ 
tion 7.2 above) and make it difficult to link pattern 
and process. A more informative approach may be 
to ask when—and how—a particular SAD is main¬ 
tained in an assemblage through time (e.g. Thibault 
et al. 2004). 

7.7 Partitioning diversity in space 
and time 

From the preceding discussion it is clear that biolog¬ 
ical diversity varies through both space and time. It 
is possible to tease the spatial and temporal com¬ 
ponents apart to see where most of this diversity 
is located. Moreno and Halffter (2001), for example, 
attributed species richness in a tropical bat commu¬ 
nity to both a diversity (within habitat diversity) 
and temporal turnover. Whittaker (1960) was first 
to appreciate that spatial diversity on a landscape 
(or y diversity) scale can be partitioned into a and 
|3 components. Partitioning methods are receiving 
growing attention (e.g. Lande 1996; Veech et al. 
2002; Crist et al. 2003; Crist & Veech 2006; Jost 


2007; see Chapter 6 for a fuller discussion of the 
issues involved). Until recently the focus has been 
on spatial partitions but investigators are increas¬ 
ingly considering both space and time in the same 
analysis (e.g. Tylianakis et al. 2005; Sobek et al. 2009; 
Tofnasovych & Kidwell 2009). 

7.8 Prospectus 

Better methods of assessing temporal patterns and 
shifts in biological diversity are urgently needed 
and I believe that there will be much greater empha¬ 
sis on these topics in the next few years. There 
are many aspects that deserve more attention but 
I will briefly mention four that seem important 
and tractable. First, it would be useful to extend 
partitioning methods to provide better tools for 
determining how biological diversity is distrib¬ 
uted in both time and space. For example, how 
much rarity (or commonness) is explained by the 
nested effects of seasonal, annual, and other cyclic 
events? Second, many species abundance models 
are essentially static and thus unrealistic, but could 
be modified to take account of temporal variation. 
Third, we still know relatively little about how 
communities respond to and recover from impacts 
(McGill 2006). There is growing interest in resis¬ 
tance and resilience (Thrush et al. 2009), particularly 
in the context of ecosystem function (Loreau 2010) 
and this will, I hope, stimulate the development of 
better methods of assessing change in communi¬ 
ties and measuring their response to disturbance. 
Finally, new insights will be gained by moving 
beyond a single taxon to examine changes in the 
biological diversity of different groups of organisms 
in a community through time. 

7.9 Key points 

1. As has been appreciated for many decades, eco¬ 
logical communities change through time as 
well as space. There are two types of change: 
directional change, when a community moves 
from one state to another (e.g. succession), and 
non-directional change (the product of ongoing 
turnover). 

2. As Preston observed, change occurs over sam¬ 
pling, ecological, and evolutionary timescales. 
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3. Investigators may wish to assess the extent of 
change (temporal turnover), measure change in 
the rate of change, or draw on temporal infor¬ 
mation to gain a deeper understanding of com¬ 
munity structure. 

4. Temporal turnover can be measured by species¬ 
time curves, by turnover indexes, by examining 
changes in species abundance distributions, and 
by plotting a diversity measure against time. 


5. Census interval can have a marked influence on 
the conclusions drawn and needs to be taken 
into account in analyses. 

6. There is considerable scope for the development 
of new methods and approaches to measure and 
explain temporal changes in biological diversity. 

7. Policy makers need to appreciate that change is 
natural and inevitable, and to take account of it 
in conservation planning. 
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CHAPTER 8 


Commonness and rarity 

Anne E. Magurran and Peter A. Henderson 


8.1 Introduction 

Ecological communities are composed of species, 
some of which will be abundant while the 
remainder—almost invariably the majority—are 
rare. This pattern is so pervasive that it has been 
dubbed a 'law' of ecology (McGill et al. 2007). To 
date there is only a single exception to this rule. 
A bacterium, Candidatus Desulforudis audaxviator, is 
the sole organism inhabiting a goldmine 2.8 km 
below the earth's surface (Chivian et al. 2008). 
Although an uneven distribution of species abun¬ 
dances is near universal, the shape of this distribu¬ 
tion can vary amongst assemblages. This can be due 
to the intrinsic features of a community—tropical 
systems, for example, tend to havt high numbers 
of rare species. However, it may also be influenced 
by the time frame or spatial extent over which data 
are collected, as well as sampling methodology and 
sampling intensity. Small differences in the shape 
of species abundance distributions are sometimes 
used to make inferences about the mechanisms that 
structure a community, for instance to deduce the 
extent of niche apportionment or to evaluate the 
impact of disturbance, for example during pollu¬ 
tion monitoring (Chapter 17). These assessments 
will be sensitive to shifts in the proportion of rare 
species or the balance between commonness and 
rarity. Furthermore, species may be valued differ¬ 
ently depending on their position in the abundance 
continuum. Much of the functionality in communi¬ 
ties, such as the delivery of ecosystem services, may 
reside in the common species, whereas rare species 
are often the target of conservation efforts. 

Patterns of commonness and rarity raise funda¬ 
mental ecological questions, but these are issues 
that are also playing increasingly important roles 
in management decisions. For example, relative/ 


absolute abundance may be used as a criterion 
for enacting certain conservation measures (Kunin 
1998). Similarly, changes in species abundance dis¬ 
tributions may be used a guide for identifying how 
much human activity has influenced an ecological 
community (see Chapter 17 for further discussion 
of these points). 

In this chapter we briefly review the different 
concepts of commonness and rarity, show how the 
perception of what is rare or common can change 
with the currency used to measure abundance 
or the time frame over which diversity is mea¬ 
sured, and summarize implications for biodiversity 
assessment. 

8.2 State of the field 

Darwin (1859) was well aware that taxa vary in 
their abundances and commented in The Origin of 
Species that 'rarity is the attribute of a vast number 
of species of all classes, in all countries'. Contem¬ 
porary Victorian naturalists, including Wallace and 
Bates, made similar observations in their writing. 
However, it was Preston (1948) who, following in 
the footsteps of Raunkaier (1934) and Fisher et al. 
(1943), sought a general explanation for common¬ 
ness and its distribution among species. Common¬ 
ness and rarity are terms that are in widespread use 
but which can take different meanings depending 
on context. As Preston (1948) noted, the term abun¬ 
dance is used in several different ways by ecolo¬ 
gists. These include: 

1. The total number of individuals of an extant 
species— global abundance. 

2. The total number of individuals present, at one 
moment in time, in a given area (such as a 
quadrat) or locality— local abundance. 
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3. The proportion of individuals in one species in 
relation to the abundance of all species in the 
community —relative abundance. 

4. The number of individuals recorded or 
collected —sample abundance. 

Ecologists typically focus on sample abundance, 
and often express their data as relative abundance 
since this facilitates comparisons amongst data sets. 
Only occasionally do we know the global abun¬ 
dance of a taxon, and often this is because the 
species has dwindled to a handful of individuals. 
Local abundance is more tractable, but because it 
is usually impossible to census every individual in 
an area, local abundance is usually inferred from 
sample abundance. 

Our focus here is abundance in the sense in which 
it is used to construct species abundance distri¬ 
butions or to calculate diversity statistics. There 
are, of course, other ways in which species can 
be defined as common or rare. Rabinowitz and 
her colleagues (Rabinowitz 1981; Rabinowitz et al. 
1986), for example, proposed that a species' rar¬ 
ity status depends on its geographical distribution, 
its habitat specificity, and its local population size. 
Only species with wide geographical distributions, 
broad habitat tolerance, and large local population 
sizes are designated as common according to this 
scheme. All other taxa are rare in one of seven 
ways. One category of rarity—narrow geographic 


distribution, broad habitat specificity, and small 
local population size—is only occasionally seen 
(Thomas & Mallorie 1985; Rabinowitz et al. 1986; 
Magurran 2004). 

Species occupancy (see Chapter 11), that is, the 
number of grid cells or localities that a taxon occurs 
in, is another approach (Gaston 1994). Occupancy 
tends to correlate with local species abundance 
(Gaston & Lawton 1990; Gaston & Blackburn 2000). 
However, the use of occupancy to deduce species 
rarity can be biased if occupancy has been mea¬ 
sured on a coarse scale (He & Condit 2007). 

8.3 Commonness and rarity: 
ecological context 

Species richness is the iconic diversity measure. 
It is widely used, for example in the designation 
of biodiversity hotspots, and easy to comprehend, 
although as is made clear in Chapter 4, accurate 
estimation of the extent of richness can be chal¬ 
lenging. Since species richness measures weight 
all species equally they are in essence a measure 
of the number of rare species. This emphasis 
makes it easy to overlook the fact that a handful 
of species contribute most of the abundance, 
even in highly diverse assemblages (e.g. Fig, 8.1). 
Although some rare species may have a critical 
role in a community, for example as pollinators, 
it is increasingly recognized that common species 
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Figure 8.1 A small number of species account for most of the abundance in an estuarine fish assemblage in the Bristol Channel, UK. This community has 
been sampled monthly for almost 30 years (Hfipderson & Holmes 1991; Henderson 2007). A total of 81 species have been recorded to date. The graph 
shows the cumulative relative abundance of species in the assemblage, ranked from most to least abundant. 
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Figure 8.2 Rank abundance or Whittaker plots are a popular way of presenting species abundance data. This graph shows that the shape of this 
relationship depends on the intensity of sampling, the currency used to measure abundance, and whether the data are presented in relative or absolute 
form. The data are from an estuarine fish community and were collected in the Bristol Channel, UK, during 167 h of intensive sampling in 1996, by PAH 
using previously described methods (Henderson & Holmes 1991; Henderson 2007). The three lines on each graph represent the species collected in a 1-h 
sampling period (♦), a 44-h sampling period (■), and a 167-h (a) sampling period. The number of species sampled were 12,25 and 47, respectively. 
Panels (a) and (b) depict the rank abundance plots for relative abundance data when abundance is measured as either biomass (wet weight of fish) or as 
numerical abundance (number of individuals). Panels (c) and (d) show the equivalent graphs for absolute abundance. Rarity is comparable across the 
different sampling levels when absolute abundance is used (a single individual or its biomass equivalent is the smallest unit possible). A discrete group of 
very rare species (singletons and doubletons) are evident with numerical abundance but not with biomass. 


contribute significantly to ecosystem function 
(Smith & Knapp 2003; Gaston & Fuller 2008). 
Moreover, niche partitioning models typically 
focus on the most abundant species and ignore the 
rare ones (Tokeshi 1993) on the assumption that 
the biological processes that shape abundances act 
most strongly on the common species. Common 
species are important from a human perspective as 
well given that they may provide natural harvests 
or represent damaging pests. While rare species are 
particularly vulnerable to extinction, commonness 
does not necessarily ensure survival. The passenger 
pigeon Ectopistes migratorius is a textbook example 
of a once abundant species that is now extinct. 
There are many other cases of previously common 
species, such as the bison. Bison bison, and Peruvian 


anchoveta, Engraulis ringens, that have undergone 
dramatic population declines. 

Most presentations of biodiversity data, such 
as rank abundance plots ('Whittaker' plots) or 
frequency distributions such as 'Preston plots', are 
drawn to depict relative abundances (e.g. Fig. 8.2). 
In these the abundance of the most abundant 
species will inevitably decline as the sample size 
(or assemblage size) increases (May 1975). For 
example, the relative abundance of the most 
abundant fish in the community depicted in Fig. 8.2 
falls from 82% when the community has been 
sampled for 1 h to 73% when it has been sampled 
for 44 h, and to 67% when it has been sampled for 
167h. These data refer to numerical abundance. The 
equivalent figures for biomass are 67,54, and 46%. 
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Relative abundance plots facilitate the investi¬ 
gation of community structure across assemblages 
that vary in size and species composition (McGill 
et al. 2007). However, the untransformed abun¬ 
dance of the same species in nature may show a 
different pattern. For example, the absolute abun¬ 
dance of the most abundant species in Trinidadian 
freshwater fish communities is independent of com¬ 
munity size, whereas its proportional abundance 
decreases with increasing species richness (Dor- 
nelas et al. in press). This observation has impli¬ 
cations for niche apportionment and species pack¬ 
ing. It suggests that there is an upper limit in 
abundance, in numerical terms at least, of species 
in a community with given environmental char¬ 
acteristics (Lawton 1990). For instance, in Trinida¬ 
dian streams guppies (Poecilia reticulata) are often 
the numerically dominant species but it seems that 
resource availability, which will include food and 
physical space, sets a cap on overall abundance 
(Magurran 2005). 

We still know relatively little about how the 
abundances of common species vary amongst com¬ 
parable communities along a gradient, such as a 
climatic one, but it would be interesting to ask 
whether any differences that are observed are 
greater than those that would be expected by 
chance. 

If some species are common, many more are rare. 
The problem, however, is that ecological surveys 
can give the impression that extreme rarity is wide¬ 
spread. Coddington et al. (2009) analysed 71 of 
largest and most ambitious tropical arthropod sur¬ 
veys and found that an average of 32% of species 
were singletons (Fig. 8.3). The notion that around 
a third of the species in an assemblage are repre¬ 
sented by a single individual does not make biolog¬ 
ical sense—indeed it is likely that most, if not all, 
are members of viable, breeding populations. Vari¬ 
ous explanations have been put forward to explain 
the preponderance of singletons, particularly those 
found in rich tropical assemblages. These include 
habitat specificity (e.g. Price et al. 1995), the pres¬ 
ence of vagrant species (e.g. Southwood 1996), the 
presence of immature forms that cannot be identi¬ 
fied (e.g. Longino et al. 2002), and sampling effects, 
such as the occasional appearance of individuals in 
traps designed to capture different types of organ- 
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Figure 8.3 (a) Frequency distribution of the percentage of singletons 
observed in surveys of tropical arthropods, (b) Number of singleton species 
in relation to species richness. Data from Coddington et al. (2009). 

ism (e.g. Scharff et al. 2003). Coddington et al. 
(2009) argue that biological hypotheses, specifically 
small body size, cryptic habitats, male-biased sex 
ratio, and clumped distributions, cannot explain the 
observed frequency of singletons. Instead, under¬ 
sampling bias is the most likely cause. They suggest 
that the true singleton frequency is probably in the 
order of 4%. A related problem is that species may 
vary in their detectability (see Chapter 3) such that 
some taxa appear to be rarer (or more common) 
than they actually are in nature. 

Many communities hold a small number of liv¬ 
ing fossils; coelacanths (Latimeria chalumnae) and 
yew trees (Taxus baccatus) are two well-known 
examples. These species, representatives of once 
dominant taxonomic groups, are almost always 
rare in abundance and/or highly localized in their 
distribution. Their rare status may make it more dif- 
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ficult for specialist pathogens, parasites, predators, 
or herbivores to evolve or maintain populations 
(in the same way that Apple computers receive 
fewer attacks from viruses). For example, yew 
is an ancient woodland tree with few associated 
herbivores. Kennedy and Southwood (1984) noted 
that two British oak ( Quercus ) species support 
423 insect species. The equivalent figure for yew 
is six. Additionally there may be adaptations to 
enable populations to persist at low densities. 
One key adaptation of plants such as yew is 
extreme longevity. Thus, although the species may 
be rare at any one point in time, mature indi¬ 
viduals are exceedingly persistent through time. 
Other instances where species appear to have 
adaptations that enable them to persist despite 
being rare include some freshwater fish (Magurran 
2009). For example, geographically isolated white- 
fish species—the pollan, powan, and vendace ( Core - 
gonus spp.)—have survived in the British Isles since 
the last ice age while several members of the Good- 
eid family, for example Characodon audax and Allo- 
toca catarinae, have naturally restricted distributions 
in Mexico. 

An investigator's perception of which species 
are rare (or common) can depend on which sites 
are investigated, the timing and duration of sam¬ 
pling, and the currency used to measure abun¬ 
dance. Species that are rare in one locality may 
be more abundant elsewhere. Murray et al. (1999) 
asked whether species of trees that are in the tail of 
a rank abundance curve are rare throughout their 
geographical range. They found that 91% of species 
from dry sclerophyll woodland and 95% of species 
in temperate woodland were abundant in other 
places. Similarly, some species that are typically 
of low abundance, or apparently absent from the 
assemblage altogether, are exceptionally abundant 
at certain times. Cicadas are a classic example of 
such an organism (Yang 2004). 

The manner in which abundance is measured 
can also affect the perception of rarity. Numeri¬ 
cal abundance (number of individuals) and bio¬ 
mass are two common metrics. Energy use, which 
is often inferred rather than measured directly, is 
another. It is increasingly clear that different curren¬ 
cies lead to different conclusions about the structure 
of the species abundance distribution, and different 
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Figure 8.4 Species may have different rankings in alternative 
currencies. This figure illustrates rank abundance plots for numerical 
abundance (•) and biomass (□) during an intensive survey of an estuarine 
fish community (as in Fig. 8.2). Species ranks are based on numerical 
abundance. 

rankings of species in terms of relative abundance 
(e.g. Saint-Germain et al. (2007), Morion et al. 
(2009), and Henderson & Magurran (2010), and see 
Figs 8.2 and 8.4). 

The high level of resources that biodiversity sur¬ 
veys demand (Lawton et al. 1998; Gardner et al. 
2007) means that snapshot samples are common. 
Sometimes, however, longer surveys are possible 
and it is now well appreciated that biological 
diversity increases with sampling duration (Preston 
1960; Rosenzweig 1995; Thibault et al. 2004; White 
et al. 2006; Magurran 2007; White 2007; Magurran 
2008; see Chapter 7) in an analogous manner to the 
well-known species area curve (e.g. Fig. 8.5). The 
increment in species number is to a large extent 
due to the occasional presence of low-abundance 
species. In some cases this will be because the num¬ 
bers of a particular species are so low that it is 
unlikely to be sampled. In other cases these rare 
species reflect temporal turnover in species (immi¬ 
gration and local extinction) that all communities 
experience (MacArthur & Wilson 1967). Repeated 
sampling at a site is known to inflate the number 
of rare species (McGill 2003a; Magurran 2007). With 
the exception of some well-characterized assem¬ 
blages (e.g. Magurran & Henderson 2003) it is not 
always easy to identify which rare species belong to 
an assemblage and which are just 'passing through'. 
Further research is needed to resolve this important 
question. 
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Figure 8.5 (a) Although the cumulative number of species (—) in an estuarine fish assemblage (Henderson & Holmes 1991; Henderson 2007) increases 
through time, annual species richness (—) remains constant. The rise in total recorded species is due to occasional species, which appear only sporadically 
in the record and are rare in terms of their abundance, (b) The percentage of overall abundance (number of individuals) in each year of the series that is 
attributed to species present in all years. 


8.4 Assessing commonness and rarity 

Identifying common and rare species is a fre¬ 
quent requirement, although not necessarily one 
that is easy to accomplish. Ecologists, for example, 
may wish to separate common and rare species 
when evaluating function. Decisions about the 
boundaries between commonness and rarity (or fre¬ 
quent and infrequent species) are also required for 
some of the species richness estimators (e.g. Colwell 
2009; Chapter 4). Furthermore, a changing pattern 
of rarity and commonness (or dominance) can be 
an important indicator of an environmental impact 
(e g. Warwick 1986; Warwick & Clarke 1991). 

The most straightforward way of designating 
common and rare species is in relation to their 
position on a rank abundance plot. However, it 
is not always obvious where to draw the bound¬ 
ary between common species and the rare ones. A 
number of approaches have been suggested. One 
option is to assign a pre-determined fraction of 
species to rare and common categories. For exam¬ 
ple, the lower and upper quartiles in a ranked abun¬ 
dance distribution could be used to identify rare 
and common species, respectively (Gaston 1994). 
The disadvantage of this is that some assemblages, 
such as the tropical arthropod ones mentioned 
above, have large numbers of rare species (Cod- 
dington et al. 2009). A further difficulty is that 
the quartile approach can classify different species 
as rare, depending on the currency used. Cur¬ 
rency issues also apply to other categorizations of 
rarity. 


A second method is to adopt absolute definitions 
of rarity, such as singleton status (Magurran 2004). 
However, as already seen, the presence of sin¬ 
gletons may, to a large extent, be a reflection of 
thoroughness of sampling. By the same token the 
absolute abundance of the most abundant species 
will usually be a measure of sampling effort. 
Complete censuses of ecologically meaningful enti¬ 
ties, such as a pond or tree hole community, are one 
occasion on which the use of absolute abundances 
would be illuminating. 

The third approach is to take account of the 
ecology of the species, and in particular the 
extent to which they are core members of the 
assemblage (Southwood 1996). Magurran and Hen¬ 
derson (2003) found a bimodal distribution of 
species occurrence in an estuarine fish commu¬ 
nity. The study has extended over nearly 30 years, 
with monthly sampling. Some species—the core 
species—are present in the majority of years in the 
record. These taxa tend to be abundant. In con¬ 
trast the occasional species, that is those that appear 
infrequently, have low abundance. Core species 
are fish that are associated with estuarine habi¬ 
tats whereas occasional species are predominantly 
found in other habitats, such as rocky shores. Simi¬ 
lar patterns have been detected in other systems, for 
example forest Hymenoptera and ground beetles 
(Ulrich & Ollik 2004; Ulrich & Zalewski 2006). Inter¬ 
estingly, core and occasional species leave different 
signatures on the species abundance distribution. 
Occasional species are responsible for the long tail 
of rare taxa; the longer the duration of the sampling 
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the more prominent this tail will become (McGill 
2003a). 

A fourth, and related, method is to ask if there are 
clusters of abundance within the species abundance 
distribution. Ugland and Gray (1982) suggested 
that the log-normal distribution represents an over¬ 
laying of three grouping of organisms, a large 
group of taxa that are consistently rare, another 
smaller group of moderately common species, and 
a third group, the smallest one, with extremely 
abundant taxa (Gray 1987). Recent research reveals 
that empirical species abundance distributions can 
indeed be multi-modal (Dornelas & Connolly 2008; 
Dornelas et al. 2009). If present, these modes (and 
there may be more than three) could be used to 
identify rare and common taxa. In addition, as wifri 
the core/occasional partition above, it may be pos¬ 
sible to use ecological knowledge of the system 
to categorize species associated with a particular 
mode. 

Alternatively, one might assign species to dif¬ 
ferent categories of commonness and rarity, and 
tally the frequencies of the different types or iden¬ 
tify them on a rank abundance or other plot. 
The Rabinowitz approach (Rabinowitz 1981; Rabi- 
nowitz et al. 1986) is one option, but others are 
also possible (e.g. La Sorte & Boecklen 2005), for 
example distinguishing species that are persistently 
common, those that experience cycles of alternating 
high and low abundance, and so on. 

A final suggestion is to track the abundance or 
rank of labelled species (e.g. Murray et al. 1999; 
MacNally 2007; Collins et al. 2008). For instance, 
one might be interested in the rank of species of 
special conservation interest or those of commercial 
significance such as pests or harvest species, and 
in how this varies amongst habitats or in relation 
to climate change. Alternatively, it would be pos¬ 
sible to examine the relative abundance, through 
space or time, of subsets of species, such as the 
dominant (or rare) fraction of an assemblage. This, 
of course, requires the development of appropri¬ 
ate null models since it is important to be able to 
distinguish changes due to anthropogenic activity 
from those that reflect the underlying ecology (such 
as turnover) as well as from change that is due to 
chance or sampling error. 


8.5 Prospectus 

The characteristic species abundance distributions 
seen in nature stand out as one of the few ecological 
laws. Ecologists have been striving to explain these 
patterns for the best part of a century and have 
developed statistical, biological, and neutral mod¬ 
els for the task (McGill et al. 2007; Chapter 9). 
However, one of the obstacles that ecologists face 
in explaining natural species abundance distribu¬ 
tions is that competing models can be equally good 
at predicting empirical patterns. A way forward 
might be to revisit the Prestonian notion of com¬ 
monness and rarity, but from the perspective that 
there are different ways of belonging to one of 
these categories. Species abundance models could 
potentially meet this challenge by tracking com¬ 
munities through time and asking, for example, 
whether species at the rare end of the distribu¬ 
tion at a given time point are there because they 
have persistently low abundance, are migrants or 
occasionals, are at the downturn of a population 
cycle, exhibit a characteristic set of traits, or rep¬ 
resent a sampling artefact. Linking distributions of 
numerical abundance and biomass could also pro¬ 
vide new insights into what it means to be common 
or rare. 


8.6 Key points 

1. Species in ecological communities vary in abun¬ 
dance. Typically, a few species are common, 
some are of intermediate abundance, and many 
are rare. 

2. The conclusions an investigator reaches about 
the structure of a community may depend on 
the currency used to measure abundance and 
whether abundances are presented as absolute 
or relative values. The spatial and temporal 
extent of sampling will also shape the perception 
of commonness and rarity. 

3. High levels of rarity, such as the large num¬ 
bers of singletons seen in samples of tropical 
arthropod communities, may be the result of 
under-sampling or other sampling issues such 
as variation in detectability. This should be 
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addressed before other explanations of rarity are 
sought. 

4. Communities may be partitioned in a num¬ 
ber of ways, for example by categorizing dif¬ 
ferent types of commonness and rarity, by 
identifying separate modes of abundance in a 
species abundance distribution, or by tracking 
labelled species. Options are discussed but it is 
noted that this is an area that warrants more 
research. 


5. Historically, analyses of species abundance dis¬ 
tributions have ranked species or binned them 
by abundance. Examination of the different 
modes of rarity (and commonness) subsumed 
by such analyses could shed new light on the 
processes that structure ecological assemblages 
and help to explain spatial and temporal varia¬ 
tion in diversity. The use of different currencies 
when characterizing rare and common species 
could also be informative. 



CHAPTER 9 


Species abundance distributions 

Brian J. McGill 


9.1 Introduction 

A species abundance distribution is a quantita¬ 
tive statement about the difference in abundances 
between species in a community. In short, it is 
a statement about the commonness and rarity of 
species (see Chapter 8). As such, it cuts to the central 
questions of basic ecological research (the study of 
the distribution and abundance of organisms: Krebs 
1972; Andrewartha & Birch 1984) and of conserva¬ 
tion biology (defined by Soule as the science of rar¬ 
ity: Soule 1986). Thus it is not surprising that species 
abundance distributions are among ecology's old¬ 
est patterns, with quantitative measurements as 
far back as 1909 (Raunkaier 1909) and explanatory 
models as far back as 1932 (Motomura 1932). Every 
species abundance distribution ever measured with 
an adequate number of species (say at least three to 
five) has shown the same pattern: most species are 
rare, only a few species are common. In short, 'rare 
is common, common is rare'. 

Perhaps the most useful definition of a species 
abur. lance distribution is a vector of comparable 
abundances representing a community. The idea of 
'vector' implies that there is an abundance mea¬ 
sure for multiple species. The idea of 'compara¬ 
ble' implies that it is sensible to compare and con¬ 
trast these numbers. At a minimum comparabil¬ 
ity requires that the numbers either be true den¬ 
sities or if indices of abundance then that they 
are estimated in the same fashion across all the 
species. The idea that they represent a 'commu¬ 
nity' is the hardest to pin down. Ecologists, despite 
having community ecology as a central subdisci¬ 
pline, have never given a truly operational defini¬ 
tion to the concept of community. Here we use it 
to imply a group of organisms that overlap in time 
and space such that there is a reasonable chance 


* of interaction between the members. The tempo¬ 
ral and spatial extent can be quite variable. Spa-, 
tial extents of as little as 0.25 m 2 are common for 
grasses and milliliters are common for microbial 
organisms while communities are sometimes taken 
to cover entire continents. Similarly temporal extent 
may range from a point in time (in practice, say, a 
single day) to millions of years in paleontological 
applications. The taxonomic extent of a community 
is also quite variable (ranging from a single genus 
to entire kingdoms, e.g. all plants). The key criteria 
seems to be that there is an expectation that the 
members have the potential to interact with each 
other and the list of species is in some meaningful 
way exhaustive (species are not omitted arbitrar¬ 
ily). An extraordinary amount of data have been 
collected that fit this definition of 'a vector of com¬ 
parable abundances for the species in a commu¬ 
nity'. Nearly every dataset with multiple species 
and abundances (density or indices of abundance) 
measured qualifies. 

There is also an issue of what measure of abun¬ 
dance or what 'currency' to measure. Although 
number of individuals is by far the most common, 
this approach has limitations when applied to 
plants that show clonal growth, and it can also 
distort results in indeterminate growers, such as 
plants or fish that can vary by orders of magnitude 
of size within one species. Common currencies 
include number of individuals, percentage cover, 
biomass, and energetic use. It has been argued that 
energy use might be closer to the actual processes 
regulating community assembly and therefore 
should be preferred (Tokeshi 1999). Broadly, the 
distribution observed is the same for all of these 
currencies, but subtle and possibly important 
differences do occur (Wilson et al. 1996a; Chiarucci 
et al. 1999; Connolly et al. 2005; Morion et al. 2009). 
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What might one do with a species abundance 
distribution? A few obvious applications include: 

1. using it to test various models or theories of how 
communities are assembled 

2. using it to identify rare and common species for 
management purposes (which typically focus on 
rare species and need quantitative assessments 
of rarity to justify prioritizing some species over 
others) 

3. using it as an indicator of the ecosystem health 
of a location 

4. to compare two or more locations (or one loca¬ 
tion across time), for example how do the SADs 
of two or more communities along a productiv¬ 
ity gradient or along a human disturbance gra¬ 
dient change? Such comparisons might be done 
with any of the goals in points 1-3 in this list in 
mind. 

To date, usage 1 has probably been the most com¬ 
mon. A great deal of literature has been devoted to 
testing theoretical models of community processes 
using species abundance distributions. Thus Moto- 
mura (1932) used SADs to suggest niche pre¬ 
emption. Fisher et al. (1943) and Preston (1948) 
debated whether statistical arguments leading to 
log-series or log-normal distributions, respectively, 
were correct. More recently, literally dozens of other 
theories of community assembly have been tested 
by species abundance distributions (Hughes 1986; 
Tokeshi 1996; Dewdney 2000; Hubbell 2001; Pueyo 
2006; Harte et al. 2008). Unfortunately, species 
abundance distributions have shown little power to 
either reject theories (Popper 1959; Platt 1964) or dif¬ 
ferentiate between them (Wilson 1993; Chave et al. 
2002; McGill 2003a,b). Even when it can be deci¬ 
sively shown that one theory or model fits a par¬ 
ticular location better, very similar locations with 
very similar groups of organisms are usually fit by 
a different theory best. 

For this reason there has been a call to move 
away from using SADs for point 1 above, and 
to move towards a more comparative approach 
(point 4)(McGill et al. 2007). In general, ecology, and 
indeed all of science, works better with some form 
of comparison, whether observational or experi¬ 
mental. It is not reasonable to ask-physics to predict 
that a cloud of gas will condense to form a ball of the 


exact size and location that we now know as Earth 
(Roughgarden 2009), but it is reasonable to ask 
physics to predict what will change in the Earth's 
motion if a given gravitational attraction from the 
Sun is applied. Similarly, it may be more realistic 
to expect ecology to predict how the relative pro¬ 
portions of rare and common species will change as 
productivity is increased or logging occurs than to 
predict the precise relative proportions of the abun¬ 
dance of each species at a given location and point 
in time on Earth. 

The 1970s saw a burst of research in this direction. 
A number of authors explored how SADs changed 
shape along a variety of gradients, such as succes¬ 
sion or productivity (Whittaker 1965; Bazzaz 1975; 
Hubbell et al. 1999). Fairly strong patterns were 
suggested. For example, SADs become more even 
with higher productivity. Work on using SADs as 
ecosystem indicators was also done (Gray 1979), 
clearly showing identifiable changes to marine 
communities as a consequence of pollution. At that 
time all of the comparisons were done graphi¬ 
cally (i.e. by eye-ball). Rank abundance distribu¬ 
tion (RAD) plots (log abundance vs rank order) 
were plotted and their shapes were compared. 
Unfortunately, this research programme died out 
quickly without significant confirmation of any of 
the identified patterns (perhaps in part because 
they were incorporated so quickly into a promi¬ 
nent textbook and were taken as being well estab¬ 
lished; Whittaker 1975). In any case, relatively lit¬ 
tle effort has been made since on comparing SADs 
to inform about underlying processes or manage¬ 
ment implications (although exceptions exist, e.g. 
Wilson et al. 1996a). 

9.2 State of the field 

Whatever one's goals and whether one uses a 
single SAD or is comparing SADs as advocated 
above, there are four basic types of tools to use: 
visual, parametric, non-parametric, and multivari¬ 
ate. These are described in the following sections. 

9.2.1 Visual approaches to SADs 

The simplest way to use SADs is to plot them 
graphically. The ease with which this is done is 
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undoubtedly one of the main reasons for their 
appeal. There are three main ways to plot the data. 
The first is to create a histogram. Since the num¬ 
ber of unique abundances observed will be a large 
fraction of the number of species observed, his¬ 
tograms require aggregating or binning the data. 
There has been great controversy about the best bin¬ 
ning method. The simplest way to plot the data is 
to use an arithmetically scaled axis (Fig. 9.1(a)). Pre¬ 
ston (1948) argued for a logarithmically scaled axis 
(Fig. 9.1(b)) and introduced a method that involved 
smoothing the data (splitting the number of species 
found on bin boundaries across two bins). Some 
have argued for a binning that accounts for the 
strong skew of the data by lumping the extreme 
right tail of the data into a single very large bin 
on an arithmetic axis, which is sometimes referred 


to as the N +1 method (Nee 2003). Unfortunately, 
the results and interpretation of the graphs depends 
heavily on the binning method used (Gray et al. 
2006). In particular, the interpretation of the num¬ 
ber of rare species depends on whether an arith¬ 
metic or logarithmic scale is used and which exact 
logarithmic scale. More generally, binning repre¬ 
sents a loss of data, which is a statistical no-no 
when avoidable. More modern techniques for esti¬ 
mating probability density functions (PDFs) such 
as kernel estimation (Martinez & Martinez 2002) 
could and probably should be used in preference 
to binning (see the curved line on Fig. 9.1(a) and 
(b)). As far back as 1957, MacArthur pointed out 
these problems and suggested an alternative plot¬ 
ting method, later championed by Whittaker (1965), 
which has become known as the rank abundance 



Figure 9.1 Four different ways to plot species abundance data: (a) a histogram (binned) plot on an arithmetic scale; (b) a histogram plot on a log scale 
(in both (a) and (b) the curved line represents a kernel-smoothed representation of the data); (c) a classic rank abundance distribution; (d) a scaled 
empirical cumulative distribution function plot (see Box 9.1). The curved line represents a logistic curve fit to the data. This data and the data used in 
Figure 9.3(a) and (b) are based on trees >1 cm dbh on transects run at Mt St Hilaire nature preserve operated by McGill University as a field station. The 
data were collected by students in the 2006 Field Methods in Ecology and Behavior course. 







108 BIOLOGICAL DIVERSITY 



0 10 20 
Rank 




-o 1370-1680 m 
1680-1920 m 
1920-2140 m 


Figure 9.2 Different ways to plot an SAD. This figure is a plot of the data reported in Whittaker 1960. (a) Classic rank abundance distribution (RAD) plot 
with log N on the y-axis and rank (1... S) on the x-axis. (b) The empirical cumulative distribution function (ECDF) with log N on the x-axis and proportion 
of all species on the y-axis. (c) ECDF but with the abundances transformed into percentages of all individuals observed for the given community (here N = 
2949, 3342, and 3741, respectively). 


plot (Fig. 9.1(c)). In this plot the individual species 
abundances are sorted highest to lowest. The y- 
axis is log-abundance and the .t-axis is rank (1 = 
most abundant down to S = number of species for 
the least abundant). As MacArthur pointed out, 
this plot has a close relationship to the cumula¬ 
tive distribution function (CDF) used in probabil¬ 
ity theory to characterize a probability distribu¬ 
tion. RADs appear to be easy to interpret. Whit¬ 
taker (1965) and others have interpreted RADs such 
as those shown in Fig. 9.2(a) to indicate that low 
diversity communities are geometric (Motomura 
1932), medium diversity communities are log-series 
(Fisher et al. 1943) and high diversity communities 
are log-normal (Preston 1948). 

Two problems occur with RAD plots. The first 
is mathematical. Rank is mathematically difficult to 


deal with (it invokes the field of order statistics), but 
the use of a traditional CDF plot would allow for the 
invocation of a vast array of mathematical machin¬ 
ery developed for CDFs. Secondly, RAD plots suf¬ 
fer from a severe confoundment—by definition the 
bottom right corner of a RAD occurs at S, the 
number of species in the community. Thus species 
with higher diversity will have RADs that are more 
stretched out along the x-axis (rank) and appear 
more even. This makes it very difficult to use RADs 
to provide meaningful comparisons of species dis¬ 
tributions across communities with even moder¬ 
ate variations in species richness as the differences 
in richness quickly dominate all other patterns. A 
solution to these two problems, called the empirical 
cumulative density function (ECDF), is my recom¬ 
mended way to plot SADs (Box 9.1 and Fig. 9.2). 





SPECIES ABUNDANCE DISTRIBUTIONS 109 


Box 9.1 Empirical cumulative distribution function plot 


A simple and quick method for correcting for different 
species diversities between communities is to use a new 
type of plot (Figs 9.1(d) and 9.2). Specifically, the 
cumulative distribution function is a plot of the proportion 
of points with a value (here abundance) less than a given 
value (CDF(X) = P{x < X)). This is a standard way to 
analyse probability distributions. When the CDF is 
calculated from data rather than from a formula, it is called 
an empirical cumulative distribution function (ECDF). The 
Kolmogorov-Smirnov statistic calculates the difference 
between the ECDF and the theoretical CDF (or between 
two ECDFs). 

The CDF is also the integral of the PDF. The PDFs are 
often plotted for SADs. This means peaks in the PDF 
correspond to steep upward slopes in the CDF. For 
example, the Preston plot (1948) is just a PDF on a log 2 
scale with binning (i.e. a histogram) used to estimate the 
PDF from empirical data. However, in comparison to the 
PDF, plotting the CDF has two distinct advantages: 

1. The ECDF is easily estimated. Sort the abundances into 
increasing order. Use these as the values on the x-axis. 
Place the ranks (here from lowest to highest abundance, 
the reverse of RAD) divided by the total number of 
species on they-axis. This will give values ranging from 
1/S to 1. Normally the point (0, 0) is also included to 
make the graph complete. 

2. Because it sums across data, the ECDF is usually more 
robust (less sensitive to outliers) than the empirical PDF, 

The ECDF is also closely related to the RAD (Fig. 9.2). 
Specifically, the ECDF is generated by flipping the RAD 
about they = x line, then rescaling the rank by dividing by 
S, then inverting the rank axis (smallest to largest instead of 
largest to smallest). The ECDF has two distinct advantages 
over the RAD: 

1. By rescaling by 5 the inordinate sensitivity to differences 
in S allows for useful comparisons to be made between 
sites. 

2. By tapping into traditional mathematical machinery, it 
should be possible to import new tools into the analysis 
of SADs. 

The improved comparative power of an ECDF over a RAD 
is demonstrated in Fig. 9.2. The data are from vegetation at 
different elevations in the Siskiyou mountains (Whittaker 
1960). As noted in the main text, this and related data 
plotted as in Fig. 9.2(a) were used to argue that SADs are 
fundamentally different at different elevations, 
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Figure 9.3 Comparisons of SADs using the ECDF plot, (a) Four 
different plots compared. Data as in Fig. 9.1. We can see that the site 
at Orange Trail is very different in evenness (the evenness is much 
higher since the line is more sharply vertical, implying that most species 
have an abundance of about 4% of the total population). The other 
sites are broadly similar in this aspect of evenness with species spread 
out between 0.1 and 10%, but SouthSlope overall has a lower 
proportion of rare species (the line is shifted down, implying that a 
given percentile—say 50% of species read on the vertical axis—occurs 
at a higher proportional abundance, roughly 4% instead of 1%). (b) 
Two groups of species compared. Here data are summed across all four 
sites and one SAD is plotted for species primarily found in the canopy, 
and one SAD is plotted for species primarily found in the subcanopy 
(shade or early successional). Here it is clear that the subcanopy has 
more rare species (higher on the left side of the graph). 

Continued 
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Box 9.1 ( Continued) 

productivities, and diversities (Whittaker 1965), However, 
the ECDF in Fig. 9.2(b) gives a much more realistic depiction 
of how different the SADs are due directly to elevation and 
productivity rather than the indirect effect these factors 
have on species richness ( S ). Fig. 9.2(b) is also suggestive 
of greater evenness in the lowest site (perfect evenness 
shows in the ECDF as a step function or equivalently in the 
PDF as a delta function with all abundances concentrated 
at a single value). However, this pattern is nowhere near as 


9.2.2 Parametric approaches to SADs 

The use of the probability-based ECDF immediately 
suggests that viewing a SAD as a sample from a 
probability distribution might be useful. To have 
a reasonable chance of fitting the data, the prob¬ 
ability distribution needs to be defined only for 
positive numbers and to be strongly skewed with 
a long right tail on an arithmetic scale. The 1940s 
saw two suggested probability distributions that 
met these requirements: Fisher's log-series (1943) 
and Preston's log-normal (1948). Both fit the data 
reasonably well with the main difference between 
the two occurring in the prediction for extremely 
rare species (species with abundances of 1 or 2) 
(Chapter 10). Preston and Fisher both argued that 
their approach was superior and this argument has 
continued unresolved to the present day, with the 
added complexity of literally dozens of different 
probability distributions proposed (for extensive 
lists see Marquet et al. 2003 and McGill et al. 2007). 
It seems likely that SADs do not contain enough 
data to decisively decide which one probability dis¬ 
tribution is the best fit (if indeed there is such a 
thing as one distribution to describe all communi¬ 
ties) (McGill et al. 2007). 

Some probability distributions commonly 
applied to species abundance distributions are 
described in Box 9.2. Each distribution will have 
several (usually one to three) parameters that 
can be chosen to make the general probability 
distribution fit a particular dataset (see Chapter 10). 
These parameters can be interpreted in one 
of two ways. First, it can be assumed that the 


strong once species richness is adjusted for. Indeed, to our 
knowledge, Fig. 9.2(a) has never invoked a call for the need 
to place confidence intervals around the SADs since the 
effects are so strong, while Fig. 9.2(b) dearly demands an 
assessment of significance. On the other hand, the ECDF 
can clearly discriminate between communities. In Fig. 9.3 
the raw data (with its noise) is replaced by a logistic 
function fit to the data. Note that fitting such a function is 
tantamount to modelling the data as sampled from a 
log-logistic probability distribution. 


underlying probability distribution and any model 
that produced this distribution is correct. In this 
case the parameters can be interpreted in this 
context. For example, the migration parameter, 
m, and the speciation parameter, 9, in neutral 
theory can be estimated to provide the best fit 
to the neutral theories predicted probability 
distribution (the zero-sum multinomial or ZSM) 
and then interpreted according to their mechanistic 
definitions. Alternatively, these parameters can 
simply be seen as geometric descriptors of the 
shape of the species abundance distribution 
without assuming the underlying model is 
necessarily true or applicable. In this scenario, 
for the ZSM, m can be seen as a descriptor of the 
shape of the left tail of the SAD and the number 
of rare species, while 9 can be seen as a scaling 
parameter that broadly affects the overall shape 
and the right tail (McGill et al. 2006). Under either 
interpretation, these metrics can broadly be called 
'parametric' as they are derived as the parameters 
of a mathematical probability distribution that 
summarizes the shape of the SAD. 

9.2.3 Non-parametric approaches to SADs 

Many other numbers or metrics can be used to 
describe the shape of a SAD that do not derive 
from a probability distribution. Some of these are 
intended to measure a specific property of the SAD, 
such as the diversity or the evenness. Others are 
just geometric descriptors of the shape of the SAD. 
An example of the latter is the slope of the RAD 
(Nee et al. 1992). A number of these measures are 
summarized in Box 5.1. 
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Box 9.2 Parametric measure of SAD shape 


Parametric measures are the parameters of a 
mathematically described probability distribution. Such 
parameters may be interpreted either under the assumption 
that the probability distribution applies to the data or 
merely as a useful descriptor of the shape of the data. In 
the following list, the name of the distribution is given with 
its parameters following in parenthesis. After some brief 
comments, a best-practice method for estimation is 
referenced and the formula for the distribution is given. 
Some common and useful distributions include the 
following: 

• Pareto/power (c): The Pareto or power distribution is 
based on p(r?)oc1 /n c . The Pareto can be justified as the 
limit of the log-normal distribution with high variance 
(Montroll & Shlesinger 1982), as the sole distribution 
surviving basic transformations (Mandelbrot 1963), as a 
sign of fracticality (Mandelbrot 1982), or when c = 1 as 
a sign of self-organized criticality (Pueyo 2006). Most 
SADs fail to be well fit by the Pareto. There are several 
variants of this distribution and parameter estimation can 
be tricky (White et al. 2008). I recommend the Discrete 
Pareto which runs from 1 to N. A simple analytical 
maximum likelihood estimator exists (if using software 
that can calculate the harmonic function, £(c)). 

p(n) = n~ c / C(c) 

• PowBend (/?, co): Pueyo (2006) suggested that the 
Pareto distribution should apply to SADs but failed 
because of sample size limitations, in particular the very 
common species are not accurately represented in their 
true proportions due to finite sample size. He derived a 
modified power distribution (the power-bended 
distribution or PowBend) that bends the distribution 
down to have fewer large species to represent this. 
Nominally, the parameter /3 is the same as the Pareto c, 
while cjj represents the degree of bending. In practice co 
ensures a fit to common species, which allows jS to fit the 
rare species portion of the SAD well. Pueyo recommends 
a four-step programme: 

1. Fit a 1 /x distribution (i.e. power distribution with 

c = 1 or, equivalently, the powbend with co = 0 and 

(3=D. 

2. If the fit is poor, allow c to vary in the power 
distribution (or equivalently vary j8 with co = 0). 

3. If the fit is still poor, switch to the PowBend and allow 
co to also vary. 

4. If this still fails, a log-normal can be fit. 


These four steps correspond to using the first to fourth 
terms in a Taylor power series, and can be used to detect 
the complexity of the community. Parameters can be 
estimated by numerical maximization of the likelihood 
surface, p (n) = CrT^expf-con) 

• Zipf-Mandelbrot (b , c): Another modification of 
the power distribution is the Zipf-Mandelbrot 
distribution, which contains the power distribution as a 
special case when b = 0. It was first applied to word 
frequencies and in ecology is interpreted as a sort of 
successional model where later species have increasingly 
complex ecological requirements and as a result are extra 
rare (Frontier 1985). In practice, unlike the Pareto with 
its excessively rigid one-parameter shape, but like the 
PowBend, the Zipf-Mandelbrot has a greatly improved fit 
with two parameters, allowing one parameter (c) to 
describe fit to common species and the other ( b) to 
describe fit to rare species. Fitting the Zipf-Mandelbrot 
can be tricky (Izsak 2006). I recommend minimizing a x 2 
goodness-of-fit only on abundances >5 with a careful 
choice of initial values to speed convergence, p (n) = 

(b + n) c IH{N,b,c) where H(N,b,c) is a normalizing 
constant and is an extension of the £ function where 
H(N, b,c)= X(/ + b)~ c and the sum goes from 
/' = 1.. .A/. 

• Log-normal (p, ay.*, a *, cv, cv*): The log-normal 
distribution arises from the central limit theorem on a 
product of random variables (Limpert et al. 2001). A 
variety of biological interpretations can fit this description 
(McGill 2003a; McGill & Nekola 2010). The two 
parameters are the mean, y, and the standard deviation, 
a, of the (usually natural) log-transformed abundances. 
However, y* = exp(/u.) is often reported because y* is 
in units of abundance. Some also transform 

a* = exp(cr) to return a to units of abundance, but this 
is less common. Clearly yi is a scale parameter (position 
on the x-axis of abundance), but shape depends on the 
interaction between y and cr. The coefficient of variation 
(cv = a/y or cv* = a*/y*) serves as a shape 
descriptor, with cv < 1 indicating a nearly normal shape 
(even on an arithmetic scale) and the skewness 
increasing as the cv increases. Parameter estimation is 
trivial (simply log transform and take the mean and 
standard deviation). 

p(n) = exp[— log(r?//x) 2 /(2cr 2 )]/[r?ff(277-) ,/2 ] 


Continued 
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Box 9.2 ( Continued) 

• Poisson log-normal (^, a , ft*, a*, cv, cv*): The 

main problem with the log-normal is that it is a 
continuous distribution allowing for fractional 
abundances. One approach is to develop a sampling 
model (Pielou 1975; Dewdney 1998; Green & Plotkin 
2007) (see Chapter 10) where the abstract abundance of 
a species is given by A sampled from a log-normal, but 
the observed abundance is sampled from the Poisson 
distribution with the one parameter (the average value) 
given by A. Parameter estimation requires numerical 
maximization of the likelihood surface and can be 
computationally intense as well as requiring some good 
numerical approximations (Chapter 10, Bulmer 1974). 

/ OO 

0 p LN (A)dA where p LN (A) is the 
log-normal probability (see above) 

• Gamma (scale, shape): The gamma distribution is 
another continuous probability distribution on the range 
(0, oo). It is broadly used in mathematics as a general, 
flexible distribution. Unlike the log-normal, its two 
parameters are clearly divided into a scale parameter 
(position on the x-axis) and the shape parameter, with 
shape again varying from modal to a single mode near 
N = 0. Both the gamma and the Weibull distribution 
(next) are mathematically more tractable than the 
log-normal. The gamma distribution is not frequently 
used for SADs, but occasionally it is used and in some 
cases found to be the best-fitting distribution (Plotkin & 
Muller-Landau 2002). There is a simple formula for 
maximum likelihood estimation (Evans et al. 1993). 
p(n) = (n/scale) shape ''exp(-/7/scale)/(r(/7)scale), where 
F(n) is the gamma function. 

• Weibull (scale, shape): The Weibull is another widely 
used continuous probability distribution defined on 
(0, oo) and therefore sensible for modelling abundances. 
Again there is a simple MLE formula for estimation 
(Evans et al. 1993). 

p(n) = shape x (n/scale) shape ' 1 exp[-(n/scale) shape ]/scale 
• NegBin ( P,K ): Just as one can make a Poisson 
log-normal to represent integer sampling from an 
abstract continuous distribution, one can make a Poisson 
gamma distribution. This is called the negative binomial 
distribution. Traditionally, the distribution is zero 
truncated (the probability of sampling abundance 0 is 
removed by rescaling). The negative binomial has several 
different parameterizations and can be confusing. I 
recommend following the one given by Pielou (Pielou 
1975, p. 35-38) along with her estimation method. In 
this case, P is the scale parameter of the gamma 


distribution and k is the shape parameter. One 
interpretation is that k measures inverse distance from 
the Poisson distribution (k = oo gives the Poisson while 
k = 0 is far from Poisson). Thus k has been used in 
studies of abundances in quadrats to indicate the degree 
of clumping, but it is unclear how this relates to SADs. 
n / n \ _ Tm ( p \ n i 
pW -ZTU)Vi7e) (uM-i 

Log-series (c): The log-series is the limit of the negative 
binomial as k goes to zero (with some mathematical 
details added) and the zero abundances are truncated 
(Fisher et al. 1943). It has a single parameter, c. Fisher's 
a is technically not a parameter of the distribution but is 
instead an implicit function of the parameters, 5 and N 
(specifically 5 = a ln[1 + N/a]). Fisher's a is generally 
recognized as a measure analogous to species richness, 
although in different units. Estimation of c can be done 
by numerically solving for c in the equation 

(l-Oto9(l-0 -P^ = k > Where 


n, = 


k = —1/ln(1 -c) 

• ZSM (m, 9): The birth-death-immigration equations of 
neutral theory can be solved to produce what is known 
as the zero-sum multinomial (ZSM) distribution (Hubbell 
2001; Etienne 2005), although the zero-sum assumption 
is not critical (Etienne et al. 2007). It has two parameters. 
Nominally 9 is a speciation rate and m is the percentage 
of individuals migrating from outside the local 
community. In practice these values are impossible to 
measure and 6 and m are fit by maximum likelihood 
estimation with the likelihood calculated using complex 
combinatoric methods (Etienne 2005). This optimization 
is made even more tricky because there is a long ridge of 
nearly equal likelihood (Etienne et al. 2006). The 
parameter 6 is strongly correlated with species richness 
(and is equal to Fisher's a when m = 0, in which case 
the ZSM is the same as the log-series). p(n) has a 
complex representation (Etienne & Olff 2004; Etienne 


2005). 


• Geometric (r): The first used probability distribution for 
SADs is the geometric, but ironically it is not actually a 
probability distribution but a series. The proportion of 
abundance in each species is a function of its rank, not a 
random sample from a distribution, and is different from 
the geometric probability distribution (May 1975). 
Flowever, it is often still treated as a probability 
distribution. It has a single parameter, often called k or r. 
This represents the fraction of the remaining community 
captured by each successive species. Several approaches 
for estimation are available (He & Tang 2008) of which I 
find He and Tang's new method the best. 
n, = C(k)NkO - <c)' _1 , where C(k) is a 
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normalizing constant C(k) = 1 /[I - (1 - k) 5 ], N is total 
abundance and S is species richness. 

in summary two groups of distributions emerge. One 
group contains the Pareto distribution with extensions to 
create the PowerBend, Zipf—Mandelbrot and the 
conceptually distinct but mathematically related log-series. 
The second group contains three continuous functions 
(log-normal, y and Weibull) and their Poisson sampling 
extensions (Poission-Lognormel, Poisson—Weibull, not 
discussed here, and Poisson Gamma = negative binomial). 
The ZSM and geometric stand alone. There are numerous 
(probably dozens) of other probability distributions that are 


9.2.4 Multivariate approaches to SADs 

The final approach to dealing with the vector of 
numbers measured in an SAD is to treat it in 
its full multivariate nature. This is rarely done 
with a single SAD (but see, for example, Alonso 
2005). With multiple SADs it becomes possible to 
define measures of dissimilarity distance between 
communities (over 70 different distance measures 
have been defined—Legendre & Legendre 1998; 
Chapter 6—and perform cluster analysis, ordina¬ 
tion and other techniques- Clarke & Warwick 2001; 
Manly 2004). For an example see Figure 17.4. This 
approach shares with the visual approach the ben¬ 
efit that no data are thrown away. The use of one 
or a few metrics (parametric or non-parametric) as 
described in sections 9.2.2. and 9.2.3 involves data 
reduction, and the multivariate approach has the 
benefit over the visual approach of being quantita¬ 
tively rigorous. Yet the multivariate approach has 
had limited uptake. This is probably largely due to 
the complexity and training required to correctly 
perform multivariate statistics. One of the major 
benefits and reasons for frequent use of SADs is 
their easy-to-grasp nature. Moreover, our intuition 
starts to fail in a multivariate world, for exam¬ 
ple the definition of order falls apart in a multi¬ 
variate world, but the first three approaches allow 
for unambiguous statements like there are more 
rare species in data set X or data set Y is more 
even. 


not mentioned here (Marquet et al. 2003; McGill et al. 
2007). The power fraction extensions of the geometric 
(Tokeshi 1996) are interesting, but currently have no 
analytical approaches and must be solved by repeated 
simulation. The log-logistic distribution (the logistic 
distribution on log abundances) is a little known but valid 
distribution and is essentially the distribution one assumes 
when fitting a logistic curve to the ECDF (Fig. 9.1 and 
section 9.2.1). Since it is primarily a descriptor of visual 
shape I treat it as non-parametric (Box 5.1), but it has 
sometimes been argued as a probability distribution 
outright relevant to SADs (Williamson & Gaston 2005). .. 


9.3 Identifying a useful, parsimonious 
subset of SAD metrics 

If one wishes to use the visual or multivariate 
approaches, then the large number of metrics of 
SADs is not a problem. Otherwise, the prolifera¬ 
tion of dozens of metrics to describe SADs is prob¬ 
lematic. Indeed, for SADs with only a few (5-15) 
species, there are more metrics that can be calcu¬ 
lated on it than there are raw data points. Thus there 
must be redundancy among the diverse metrics. 
There have been some attempts to identify some 
metrics as superior to others (Hurlbert 1971; Kemp- 
ton & Taylor 1974; Smith & Wilson 1996; Wilsey 
et al. 2005). However, a truly systematic evalua¬ 
tion of both parametric and non-parametric met¬ 
rics on SADs is lacking. I seek to identify a subset 
of all metrics that can usefully but parsimoniously 
describe the essential features of an SAD. 

I suggest that there are four properties that a good 
set of metrics should have: 

1. Efficiency: No metric will work perfectly on 
small samples (the shape of the SAD is meaning¬ 
less with one individual and is nearly as mean¬ 
ingless with 10). However, good metrics should 
quickly converge to their true values as sample 
size increases and the standard error should like¬ 
wise quickly decrease as sample size increases. 

2. Unbiased: A good metric will not consistently 
undershot or overshoot the true value when the 
sample size is small. In other words, the metric 
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should be on average equal to the true value 
when many different samples are compared. 

3. Interpretability: A good metric should be intu¬ 
itive. It should describe geometric aspects of the 
SAD that can be easily visualized or that describe 
factors of the SAD which are biologically rele¬ 
vant or ideally both. Evenness, in its qualitative 
sense, is an example that has a clear geomet¬ 
ric interpretation (close to the horizontal line in 
RAD or close to the step function ECDF) as well 
as a clear biological relevance. 

4. Orthogonality: The first three are properties of 
individual metrics. Orthogonality is a property 
of sets of metrics. A set of metrics should contain 
little overlap or redundancy in what the mem¬ 
ber metrics measure. In short, the collinearity or 
correlation between the different metrics should 
be low. Thus species richness, S, and Shannon 
diversity, H, have a lot of redundancy, as do H 
and Shannon evenness, /, but S and / are mod¬ 
erately orthogonal (although it has been shown 
that even / depends on S and evenness mea¬ 
sures other than / show greater orthogonality to 
S, e.g. Smith & Wilson 1996). I suggest that SAD 
metrics are currently in the state that landscape 
metrics were in before Riiters and colleagues 
(1995) performed their analysis. Over 50 land¬ 
scape metrics had been suggested, but a factor 
analysis showed that all the metrics were well 
described by only six underlying factors. 

Factors such as dependency on sample size and 
orthogonality can be calculated analytically for spe¬ 
cific individual metrics or pairs of metrics (Hurlbert 
1971; May 1975; Smith & Wilson 1996), but such 
analytic results are not generally available for most 
of our proposed metrics, thereby necessitating an 
empirical approach. To assess which subset(s) of 
metrics out of all parametric (Box 9.2) and non- 
parametric (Box 5.1) meet these goals, I performed 
two simple analyses. 

9.3.1 Efficiency and bias 

The first analysis assessed bias and efficiency. I 
took five empirical datasets with more than 10 000 
individuals ( N ): Barro Colorado Island tropical 
trees (Condit et al. 1996), Winemiller's Venezuelan 


fish (Winemiller 1990), Fisher's Malaysian moth 
light trap data (Fisher et al. 1943), Hick's cope- 
pod data (Hicks 1980) and one site from Pear¬ 
son's data on macrobenthos in Loch Linhe (Pearson 
1975). Ten replicate subcommunities each of sizes 
N = 30,100,1000 and 10 000 were randomly sam¬ 
pled from the original community. All 52 metrics 
(32 non-parametric, 20 parametric) were calculated 
on each replicate subcommunity and also on the 
original full community. The latter was assumed to 
be the true value of the metric for each dataset. Esti¬ 
mated metrics were averaged across the 10 repli¬ 
cates and then calculated as a ratio to the true met¬ 
ric for the full community (so a value < 1 indi¬ 
cates smaller than the true value, > 1 indicates 
greater than the true value). The results are found 
in Figs 9.4 and 9.5 and summarized in Tables 9.1 
and 9.2. 

Overall, it can be seen that small sample sizes 
give very misleading estimates. Most metrics only 
began to come close to their true value (even to 
within ±50%) with at least 1000 individuals sam¬ 
pled and in many cases only with 10 000 individ¬ 
uals sampled. This is problematic as a vast major¬ 
ity of studies of SADs are done with at most a 
few hundred individuals. Most metrics are also 
biased. Among the parametric measures, those 
related to the power distribution (Pareto/Power, 
Zipf-Mandelbrot and especially the log-series) do 
the best. Among the non-parametric measures, the 
classic diversity measures (the Shannon diversity 
and the closely related pair of Simpson diversity 
and Hurlbert Probability of Interspecific Encounter 
diversity measures) have nearly perfect perfor¬ 
mances converging to the true answer with only 
a few dozen individuals and being unbiased. 
The equivalent evenness measures do not fare 
as well, although the Shannon evenness remains 
one of the best evenness measures. The non¬ 
absolute dominance measures (relative dominance 
and McNaughton dominance), the percentage rare 
measures and the intercept from the logit on the 
ECDF (iiogit) all do fairly well (although to have 
any species that are rare at the 1% level several 
hundred individuals are needed). Overall the mes¬ 
sage of this analysis is a sobering one—large sam¬ 
ples in the order of 1000 individuals are needed to 
truly assess species abundance distributions. With 
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Figure 9.4 Efficiency and bias of parametric metrics. Each line represents the average across 10 replicate draws from a single dataset. The lines give the 
ratio to the true value, with 1 (the dashed line) representing a perfect match. Values greater than 1 represent overestimates. The x-axis represents sample 
size, N. 


smaller samples, one is best to limit analysis to 
the Simpson and Shannon diversity measures, the 
dominance and percentage rare measures and the 
log-series c and Zipf-Mandelbrot c measures. 

9.3.2 Independence of measures 

The second analysis seeks to reduce the dimension¬ 
ality and find orthogonal metrics. Datasets repre¬ 
senting 91 communities were analysed, including 
21 from Morlaix (different points in time before and 
after a disturbance: Dauvin 1984), the aforemen¬ 
tioned Hicks copepod data, the aforementioned 
BCI data divided into 10 strips of 5 ha each, 13 
communities of macrobenthos from a succession 
experiment (Arntz & Rumohr 1982), 30 communi¬ 
ties of macrobenthos from Loch Linhe and Loch 
Eil (Pearson 1975), 20 randomly chosen routes 
from the North American Breeding Bird Survey 


with at least 400 individuals (Robbins et al. 1986; 
Patuxent Wildlife Research Center 2001), and the 
aforementioned Winemiller fish and Fisher moths. 
The last three datasets (22 communities) were not 
exhaustively sampled but instead depended on 
individuals that approached the counting device 
or observer. The first five datasets (69 communi¬ 
ties) were all exhaustively sampled (every indi¬ 
vidual in a well-defined area was identified and 
counted). In addition 55 randomly generated com¬ 
munities (samples from a probability distribution) 
were created, composed of five replicates of 11 dif¬ 
ferent distribution/parameter combinations (four 
sets of parameters on the ZSM, three sets of para¬ 
meters on the log-series and four sets of parame¬ 
ters on the Poisson log-normal). These communi¬ 
ties were analysed separately (exhaustively sam¬ 
pled, non-exhaustively sampled and random) and 
collectively by principal component analysis and k 
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Figure 9.5 Efficiency and bias of non-parametric metrics. Interpretation as in 

means and UPGMA clustering where the 52 metrics 
were treated as measures and the 146 communities 
were treated as observations. The results are shown 
in Fig. 9.6 and summarized in Tables 9.1 and 9.2. 
Simple univariate correlation of all measures with 
S and N were also calculated (Table 9.3). In general, 
results from the non-exhaustively sampled commu¬ 
nities were less robust than those from sampled 
communities. It would appear that it is highly desir¬ 
able to use exhaustive sampling methods whenever 
possible. 

Many measures (Table 9.3) were strongly corre¬ 
lated with S or N or both (echoing the results of the 
previous section on effects of sample size). How¬ 
ever, several measures were largely independent of 
both S and N, indicating that there is indeed addi¬ 
tional information contained in^ADs. The metrics 
most independent of S and N include the slope 
of the logistic intercept (both the standard, mi ogit , 
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Fig. 9.4. 

and the generalized logistic, m gen i og ), the migration 
parameter, m, of the ZSM and log-skewness. The 
logistic slopes can be seen as a measure of evenness 
while m and skewness more specifically describe 
proportions of rare species. 

The clustering (see Fig. 9.6) produced several 
useful results. First, three distinct groups of met¬ 
rics emerged. These groups were robust across 
all subsets of data, different measures of dis¬ 
tance and methods of clustering (hierarchical or K 
means). One group had measures broadly associ¬ 
ated with species richness, including richness per 
se and various estimators of it as well as diver¬ 
sity measures and Zipf-Mandelbrot b. Another 
group included evenness measures, including tra¬ 
ditional evenness measures, shape parameters of 
the log-normal, Weibull, gamma and negative bino¬ 
mial distribution, and the slope of the ECDF logit 
(f«iogit) and Gambin a. The third group included 





























Table 9.1 Scorecard of performance for non-parametric metrics. 
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Bias is indicated by sign (+ indicates consistent overestimation) with 0 bias being best and in bold. Efficiency represents the approximate sample size required to get an estimate dose to the true, full community value; 
lower values are best and in bold. Metrics that are independent of both S and N are ranked as VI (= very low), low (L), medium (M), high (H) and VH (= very high) with VI and L in bold. Group indicates which of the 
seven basic categories of measurement the metric is found in. Complexity indicates how difficult the metric is to calculate (L = low, M = medium, H = high). This takes into account the complexity of the formula, 
the computer skills required and the computational time required. PIE, RAD, NHC 
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Table 9.2 Scorecard of performance for semi-parametric and parametric metrics. 



Metric 

Traditional 

symbol 

Proposed 

symbol 

Bias 

(+/0/— ) 

Efficiency 

Tie to S 

and N 

Group 

Complexity 

Semi- 

Fisher's a 


a 

0 

10000 

VH 

S 

M 

parametric 










Log-normal CV 


LNCV 

0 

1000 

L 

E 

L 


Proportional LN /li* 


%LNpi* 

++ 

10000 

H 

E50 

L 


Proportional LN CP 


%LNCT* 

- 

1000 

H 

N 

L 


Proportional LN CV 


%LNCV 

0 

1000 

H 

N 

L 


Gambin a 


GGamBin 

0 

1000 

M 

E 

H 


Slope of logit at inflection 


^logit 

+ 

10000 

VL 

E 

L/H 


Intercept of logit at inflection 


f'logit 

0 

1000 

VH 

E50 

L/H 


Slope of generalized logit at 


/^genlog 

+ 

10000 

VL 

? 

L/H 


inflection 









Intercept of generalized logit at 


^genlog 

0 

1000 

H 

E50 

L/H 


inflection 









Assymetry of generalized iogit at 


dgenlog 

0 

10000 

L 

? 

L/H 


inflection 









Pareto power c 

c 

C 

++ 

1000 

M 

R 

M 


PowerBend jS 

P 

B 

++ 

10 000 

M 

R 

H 


PowerBend a> 

O) 

CO 

++ 

10000 

VL 

D 

H 


Zipf—Mandelbrot b 

b 

b 

++ 

10000 

H 

D 

H 


Zipf—Mandelbrot c 

c 

CZM 

0 

1,000 

H 

C 

H 


Log-series c 

X, c 

Qs 

- 
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M 

N 

M 


Log-normal /lo* 

/it*, scale 

M* 

— 

100000 

M 

N 

L 


Log-normal a 

CT,shape 

a 

— 
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H 

N 

L 

Parametric 
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L 

E 

H 
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- 
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L 

N 

H 


y scale 
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— 
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VH 

N 
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L 

E 

L 
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b 
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H 

N 

L 
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c 
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M 

E 

L 
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P 
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H 

N 

M 
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k 

0 
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M 

E 

M 


ZSM m 

m 

m 

++ 
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VL 

R 

H 


ZSM 9 

e 

© 
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VH 

S 

H 


Geometric r 

r, k 

r 
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H 

N 

L 


Same notations as Table 2. A complexity of L/H means that it can be easily assessed visually (L) but to compute an actual metric is of high (H) complexity. CV, LNCV, ZSM, 
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Figure 9.6 Hierarchical tree of 52 metrics (parametric and non-parametric) to describe species abundance distributions. UPGMA clustering based on 
Spearman Rank correlation distances shown here. Three major groups N/C, E, R/S/D emerge and are repeatable across many datasets and methods. The 
subgroups (N vs C and R vs S vs D) are less consistent but are useful divisions. 
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measures broadly related to total community size 
N, including N itself, scale parameters for the 
log-normal, Weibull, gamma and negative bino¬ 
mial, the standard deviations (<j) of the various 
log-normal distributions, the dominance measures, 
Zipf-Mandelbrot c and, surprisingly, Smith-Wilson 
evenness. The existence of these three groups was 
robust. The relationship between the three groups 
(which two were most closely related) was not 
robust, varying with data and methods. Similarly, 
several subgroups within the three groups emerged 
(Fig. 9.6). These subgroupings were moderately 
robust across data and methods, although individ¬ 
ual measures did switch between subgroups, and 
the relationships between subgroups often varied. 
However, because of the overall intuitive sensibil¬ 
ity of these subgroupings I am inclined to regard 
them as real pending further studies. Specifically, 
three measures of dominance (abundance of com¬ 
mon species and here grouped under the label C) 
always were grouped together within the larger 
group N. These measures were relative dominance 
(Berger-Parker), McIntyre dominance and the Zipf- 
Mandelbrot c. The only consistent subgroup within 
evenness is three measures (fi ogit , fgeniog and pro¬ 
portionate log-normal n) that all directly measure 
the abundance of the 50th percentile species, which 
might be denoted E50. The richness group had 
three subgroups, one directly measuring richness 
and denoted S (such as the Chao estimator, ZSM 8, 
Margalef diversity, Fisher's a), one closely related 
group (sometimes intermingling in some analyses) 
on diversity (D), including the Shannon diversity, 
the Simpson and Hurlbert diversities and the Zipf- 
Mandelbrot b. The third subgroup was directly 
related to numbers of rare species (R), including log 
skewness, ZSM m, pareto c, PowerBend (3, percent¬ 
age singletons, and PctRarel% and PctRare N/S. 
Thus in summary, three groups were strongly sup¬ 
ported: the N or NC group, the E group (includ¬ 
ing E50), and the S or RSD group (Fig. 9.6). The 
subgroups (N vs C and R vs S vs D) were usu¬ 
ally found and are intuitive and therefore probably 
useful. 

Principal component analysis was also 
performed on the same data as the cluster analysis. 
Results were difficult to interpi^t and showed 
variability across analysis methods. However, 


Table 9.3 Correlations between different metrics and S (species rich¬ 
ness) and N (number of individuals) in 91 communities. Dif = |A/| + |5| as 
a measure of independence of 5 and N. 



N 

S 

Dif 

rHgenlogit 

-0.08 

-0.16 

0.24 

Zero Sum Multinimial m 

-0.14 

0.25 

0.39 

ITllogit 

-0.38 

0.03 

0.41 

PowBend Omega 

-0.26 

0.16 

0.42 

Log-Skew 

0.276 

0.16 

0.44 

Smith-Wilson Evenness 

0.43 

-0.08 

0.51 

Poisson Log-Normal /n 

-0.07 

-0.46 

0.53 

gamma shape 

-0.50 

0.05 

0.55 

Carmago Evenness 

-0.54 

-0.02 

0.56 

Poisson Log-Normal a 

0.53 

-0.03 

0.56 

^genlog 

0.20 

-0.38 

0.58 

Simpson Evenness (1/D/S) (1 

-0.43 

-0.15 

0.58 

Lognormal CV 

-0.39 

-0.21 

0.59 

% Singletons 

0.05 

0.56 

0.60 

Weibull shape 

-0.54 

0.08 

0.62 

PctRare N/S 

0.51 

0.13 

0.64 

NegBin k 

-0.25 

-0.39 

0.64 

PowBend p 

0.306 

0.34 

0.65 

RAD B (NHC Evenness) 

-0.05 

0.62 

0.67 

Lognormal 

0.326 

-0.39 

0.72 

Gambin alpha 

-0.23 

-0.50 

0.73 

Pareto c 

-0.06 

0.67 

0.73 

Log-series c 

0.40 

-0.35 

0.74 

Weibull scale 

0.42 

-0.38 

0.80 

Prop LN CV 

0.13 

-0.71 

0.84 

Lognormal a 

0.58 

-0.27 

0.85 

Proportionate Log-normal a * 

0.58 

-0.27 

0.85 

Proportionate Log-normal p.* 

-0.39 

-0.47 

0.86 

Shannon Evenness (J) 

-0.63 

0.23 

0.86 

Geometric r 

0.13 

-0.73 

0.86 

PctRare5% 

0.19 

0.69 

0.87 

Negative Binomial P 

0.88 

0.00 

0.89 

Zipf-Mandelbrot c 

0.59 

-0.31 

0.90 

•genlogit 

-0.62 

-0.30 

0.93 

Absolute Dominance 

0.92 

-0.03 

0.95 

Relative Dominance (Berger 

0.53 

-0.43 

0.96 

Hurlbert Diversity (1-PIE) 

-0.58 

0.39 

0.97 

Zipf-Mandelbrot b 

-0.32 

0.65 

0.97 

Simpson Diversity (1/D) 

-0.27 

0.70 

0.97 

McNaughton Dominance 

0.50 

-0.48 

0.99 

gamma scale 

0.82 

-0.20 

1.02 

PctRare 1% 

0.40 

0.63 

1.02 

ZSM 6 

-0.16 

0.89 

1.05 

N (# of individuals) 

1.00 

0.05 

1.05 

S (Richness) 

0.05 

1.00 

1.05 

* logit 

-0.64 

-0.43 

1.07 

Margalef Diversity 

-0.10 

0.98 

1.08 

Logseries a 

-0.15 

0.95 

1.10 

Shannon Diversity (H) 

-0.46 

0.65 

1.12 

Menhenick Diversity 

0.40 

0.75 

1.15 
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very roughly, the first axis explained about 40% 
of the variance and captured large numbers of 
intermediate abundance species vs large numbers 
of rare and common species. The second axis 
captured about 25% of the variance and could 
perhaps be interpreted as how like the geometric 
series the data were (with one end of the axis 
having a steep RAD (3, low richness and lots of rare 
species). The third axis captured about 10% of the 
variance and represented having high numbers of 
rare species vs. high numbers of common species. 


9.3.3 Overall assessment of useful, 
parsimonious metrics of SADs 

Tables 9.1 and 9.2 summarize these findings and 
add a column indicating how complex it is to calcu¬ 
late the various metrics. Weighing all of these fac¬ 
tors (unbiased and efficient, independent and con¬ 
sistent in meaning, easy to calculate), some recom¬ 
mendations emerge. The best measures depend on 
your goals. Three scenarios follow. 

• Comparing equal-sized samples from closely 
related communities: If the sample size is con¬ 
stant and the communities are directly compara¬ 
ble (e.g. herbaceous plants from a series of 10 x 
10 m quadrats) then it is meaningful to directly 
compare S and N (see Chapter 4). With these 
simple measures available, it would appear from 
the three clusters identified that a measure of 
evenness would add the most and the Shannon 
evenness or mi og it (the latter especially as sam¬ 
ple sizes get into the hundreds) would appear 
best. It is not clear that the Smith-Wilson even¬ 
ness truly captures evenness since it is clustered 
in group N and the Camargo and Simpson even¬ 
ness measures have poor convergence behaviour. 
One can even imagine doing a simplistic three- 
dimensional ordination by plotting a scatter plot 
in the three-dimensionsl space of S, N and E. If 
additional information is desired, a dominance 
(C) measure (such as relative or McNaughton 
dominance) and/or a rarity ( R ) measure such as 
log skewness (with more independence from S) or 
PctRare N/S (with good small sample behaviour) 
would also make sense. ZSM m works well but 


requires very large samples and very advanced 
computer skills. 

• Comparing unequal samples or samples from 
unrelated communities: When comparing data 
from, say, the tropics and temperate zone or data 
with different sample sizes, direct comparison of 
S and N is not meaningful. Gotelli and Colwell 
(Chapter 4) present methods to make S more com¬ 
parable. However, it may become desirable to use 
other measures from the S and N groups. Mar- 
galef's diversity or Fisher's a (the latter especially 
once sample sizes reach into the hundreds) are 
good, quickly converging representatives of the S 
group. Representing the N group, the log-series 
c or the Smith-Wilson evenness have good sam¬ 
pling behaviours but the latter can be confused 
with an evenness measure and should proba¬ 
bly be avoided. The remaining recommendations 
would be the same as the previous section (Shan¬ 
non evenness or m\ osit to capture evenness, E, and 
then, if desired, a dominance measure, C, and a 
rarity measure, R). 

• Very small sample sizes (< 100s): When less 
than a few hundred individuals are sampled, the 
gross inaccuracy of most metrics at representing 
the larger community must dominate goals of 
independence. In this case, the log-series c, the 
Hurlbert/Simpson and Shennon diversity, rela¬ 
tive or McNaughton dominance, and PctRare5% 
and PctRareN/S and possibly Shannon evenness 
(less rapidly convergent than the others, but the 
best of the £ cluster) make the most sense. 

9.4 Prospectus 

There can be little doubt that sophisticated multi¬ 
variate analyses are perhaps the most rigorous way 
of analyzing abundance data. However, the long¬ 
standing popularity of simpler (visual and metric- 
based) approaches to SADs suggests that not every¬ 
body can or will make the switch, therefore it is 
important to pursue a more full understanding of 
the simpler approaches as well. In this pursuit, 
a balance is needed between adding new mea¬ 
sures that might be better and keeping the num¬ 
ber of measures under control. Ecology has tended 
to see new methods added without old methods 
decisively removed. Science does not progress this 
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way (Platt 1964). I have attempted here to suggest 
some new approaches (e.g. m i 0 g it and direct mea¬ 
sures of rare species by metrics like PctRare5%) 
as well as to perform some pruning. The results 
are fairly messy. More efforts are needed in both 
directions. 

In addition, the results have suggested some tan¬ 
talizing patterns worth pursuing that might ulti¬ 
mately lead us to a more mechanistic analysis. 
The appearance that Smith-Wilson evenness (called 
Evar by them) does not align with other evenness 
measures is a surprise and requires more analysis. 
The fact that the Zipf-Mandelbrot distribution con¬ 
verges quickly and the two parameters are indepen¬ 
dent suggests that this probability model deserves 
more usage and understanding. The seeming links 
between richness (S) and rarity ( R ) and between 
abundance/sample size (N) and dominance (C) 
are provocative. These may prove to be mere arte¬ 
facts of the limited number of ways one can pack 
individuals into species, but it is not immediately 
obvious that they are. Finally, although the results 
of principal component analysis were very weak, 
the singling out of the importance of the num¬ 
ber of intermediate abundance species is sugges¬ 
tive. Gray noted that intermediate abundant species 
were the ones that responded most to environmen¬ 
tal change (Gray 1979), and Kempton (1979) sug¬ 
gested that metrics focused on species of interme¬ 
diate abundance had the greatest ability to discrim¬ 
inate between sites. As yet, few metrics have been 
targeted at this area. 

9.5 Key points 

1. Species abundance distributions have been 
widely used because they are easy to measure 
and have simple methods of analysis (visual and 
various metrics). 

2. I suggest that using a rescaled ECDF (empirical 
cumulative distribution function) is the optimal 
way to plot SADs. 


3. Most SAD metrics are very inefficient and usu¬ 
ally biased. Very large sample sizes are needed. 
In general, field workers should target sampling 
1000 individuals and if this goal is not met, only 
a few metrics are appropriate. 

4. The metrics proposed to date seem to span 
three broad features, richness/diversity, even¬ 
ness, and abundance. These three broad groups 
have some reasonably consistent subgroups, 
suggesting a total of seven different groupings 
of metrics: richness (S), diversity (D), rarity 
(R), evenness (E), with a subgroup focused on 
median abundance (E50), and abundance (N), 
with a subgroup focused on relative dominance 
of common species (C). 

5. Three specific scenarios with regards to different 
types of data are described. All of these scenarios 
can be analysed visually using the ECDF. When 
using metrics, different sets of metrics are recom¬ 
mended for the three different scenarios. 
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CHAPTER 10 


Fitting and empirical evaluation 
of models for species abundance 
distributions 

Sean R. Connolly and Maria Dornelas 


10.1 Introduction 

Identifying and explaining patterns in the common¬ 
ness and rarity of species has been a fundamental 
concern of community ecology for nearly a cen¬ 
tury (Chapter 8). Such species abundance distributions, 
when considered without reference to species iden¬ 
tity are particularly useful to ecologists for two 
reasons. Firstly, all assemblages have abundance 
distributions to compare with one another and, sec¬ 
ondly, those distributions contain more informa¬ 
tion than univariate statistics such as species rich¬ 
ness or other diversity metrics (McGill et al. 2007). 
Consequently, ecologists frequently analyse abun¬ 
dance distributions to identify systematic regulari¬ 
ties that hold across disparate assemblages and to 
test ecological theory that purports to explain such 
regularities. 

Nearly a century of investigation has iden¬ 
tified both broad-scale similarities and system¬ 
atic differences in patterns of species abundance. 
Assemblages with more than a few species over¬ 
whelmingly have a frequency distribution of 
species' abundances that exhibits a 'hollow curve': 
a plurality of species are represented by one or 
a few individuals; the number of species repre¬ 
sented by increasing numbers of individuals drops 
off sharply as abundance increases; and there is a 
very long tail of moderately to highly abundant 
species (Fig. 10.1a). However, when abundance is 
plotted on a logarithmic scale, visually identifi¬ 
able qualitative differences emerge: for instance, 
in some assemblages, one or more internal modes 
are present, but in others, no mode is apparent 


(Fig. 10.1b). The extent to which these differences 
represent stochastic variation in the numbers of 
species in each abundance class, systematic distor¬ 
tion of abundance distributions by sampling effects, 
or differences in the abundance distributions of the 
underlying communities is controversial (Hubbell 
2001; Lande et al. 2003). 

Since Motomura (1932; in Whittaker (1965)) pro¬ 
posed the geometric series to characterize pat¬ 
terns of relative abundance in benthic lake habi¬ 
tats, numerous community models have been used 
to predict species abundance distributions. Some 
of these models purport to link abundance dis¬ 
tributions to a broad range of other ecological 
quantities, such as numbers of unobserved species 
(Connolly et al. 2005), range-size frequency distri¬ 
butions (McGill & Collins 2003), species-area and 
species-time relationships (Adler 2004), body size 
distributions (Loehle 2006), spatial and temporal 
patterns in relative abundance (McGill et al. 2005; 
Dornelas et al. 2006), and niche similarity (Sugihara 
et al. 2003). Many such models predict some of these 
additional patterns, but not others, so species abun¬ 
dance distributions have become the data most 
commonly used to compare the performance of 
alternative community models. Using such data 
to compare models has been criticized recently 
because many attempts to assess model fit to 
species abundance distributions fail to identify sig¬ 
nificant discrepancies between models and data, or 
to clearly discriminate between the fit of alternative 
models. However, recent developments in model 
formulation, model fitting, goodness-of-fit testing 
and model selection have substantially increased 
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(a) (b) 



Figure 10.1 Species abundance distributions for the British Breeding Bird survey (Gaston & Blackburn (2000): solid lines) and the benthic fauna of the 
Ekofisk Oil Field (Gray et al. (1990): dashed lines) on (a) an arithmetic scale and (b) a log 2 scale. Note the strong qualitative similarities between the data 
sets on an arithmetic scale and the marked differences on the logarithmic scale. 


our ability to identify and interpret lack of model 
fit and to compare the fit of alternative models. 

In this chapter, we review and assess traditional 
and recent techniques for fitting, and for evaluating 
the fit of, species abundance models. We discuss 
different approaches for fitting species abundance 
models to data, for assessing the models' (absolute) 
goodness of fit to those data and for comparing the 
(relative) fit of alternative models. Because there has 
been an increasing diversity of approaches in each 
of these areas, we describe and critically evaluate 
these different approaches. We conclude with some 
advice about the choice of approaches to fitting and 
testing species abundance models, highlight some 
important areas for further work, and offer some 
cautionary words about the ecological interpreta¬ 
tion of the results of such analyses. 

This chapter focuses on the analysis of numeri¬ 
cal abundance (i.e. counts of individuals) because 
these data have been most intensively investigated 
and because much of the ecological theory and the 
more recent statistical modelling is tailored to such 
data. However, other abundance currencies, such 
as percentage cover, biomass and energy, have also 
been analysed (Chiarucci et al. 1999; Connolly et al. 
2005). In particular, some theory for abundance dis¬ 
tributions, such as niche-apportionment theory, is 
probably better suited to such alternative metrics 
(Tokeshi 1990). Recently, Morion et al. (2009) 


proposed a general statistical framework for iden¬ 
tifying causal relationships among different abun¬ 
dance metrics, and O'Dwyer et al. (2009) derived 
the relationships that arise for a neutral model with 
size-dependent demography. We will return to this 
topic briefly at the end of the chapter. 

10.2 State of the field 

10.2.1 Species abundance models 

Species abundance models may be theoretical (i.e. 
derived explicitly from assumptions about the bio¬ 
logical factors that generate variability in abun¬ 
dances) or phenomenological (i.e. chosen because 
they appear to resemble empirical distributions). 
Stochastic abundance models, such as neutral mod¬ 
els (Bell 2000; Hubbell 2001) and the environmental 
stochasticity models of Engen and Lande (1996a,b) 
are examples of the former, while the logit-normal 
distribution (Williamson & Gaston 2005) is an 
example of the latter. However, most models incor¬ 
porate both theoretical and phenomenological ele¬ 
ments. For instance, niche apportionment models 
are derived by assuming that a species' abundance 
is proportional to a randomly determined share of 
resources along one or more niche axes (Tokeshi 
1990). Such models reflect the biological reason¬ 
ing that species differences in access to resources 
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are the principal drivers of species abundances, 
and reflect ecological concepts such as pre-emption 
of niche space. However, a species' share of the 
available resource pool is either assumed fixed 
or chosen randomly from a uniform distribution, 
even though (to our knowledge) a biological argu¬ 
ment for these particular values or distributions 
has never been advanced in the literature. Similarly, 
Engen & Lande (1996b) show how a log-normal 
distribution of species abundances can arise in a 
stochastic model of community dynamics, but ele¬ 
ments of the model (e.g. a log-normal distribution 
of intrinsic growth rates among species) appear to 
have been chosen more for mathematical tractabil- 
ity than on biological grounds. 

In addition to reflecting the processes that gen¬ 
erate the underlying distribution of species abun¬ 
dances in the community, abundance distributions 
in ecological data sets also reflect the processes by 
which the data are sampled from the community. 
Some of the earliest attempts to fit species abun¬ 
dance models to data explicitly considered both 
of these factors. For instance, Preston (1948) pro¬ 
posed that incomplete sampling would produce a 
truncated species abundance distribution. Subse¬ 
quently, the role of incomplete sampling was for¬ 
malised by mathematical models (e.g. Pielou 1975). 
Such models typically are compound distributions 
that account for a species' abundance in the com¬ 
munity, and in the sample: 

Pr(r, n) = Pr(r|n) Pr(n) (10.1a) 

Pr(r) = X> (r, n) (10.1b) 

n 

Equation 10.1a follows from the definition of condi¬ 
tional probability. The first term is the probability 
that a species has abundance r, given that its true 
abundance is n; this quantity depends on stochastic 
sampling effects that cause observed relative abun¬ 
dance in samples to vary around the species' true 
relative abundance in the community. The second 
term is the probability that a species' true abun¬ 
dance is n; this depends on the shape of the abun¬ 
dance distribution in the underlying community, 
and thus on the biological processes that determine 
that shape. The overall probability that a species 
has abundance r in a sample (equation 10.1b) is the 
sum (or integral, if true abundance is a continu¬ 


ous variable) of equation 10.1a over all possible true 
abundance values, n. 

Models of the form given by equation 10.1 reveal 
that incomplete sampling can cause sample abun¬ 
dance distributions to differ markedly from the 
underlying community abundance distributions. 
These effects are somewhat more complicated than 
simple truncations of the true distribution, as pro¬ 
posed by Preston (1948), but they often qualitatively 
resemble such truncated distributions (Fig. 10.2). 
Most such models assume that the data are a 
random sample from the larger community (e.g. 
Pr(r|n) follows a Poisson distribution with a mean 
proportional to n), but some explicitly consider 
the effect of non-random sampling of species' 
abundances. That is, encounter rates for a species 
during ecological sampling may not be propor¬ 
tional to their true abundances in the commu¬ 
nity if species differ in detectability or in spatial 
distribution relative to the sampling scale (Chap¬ 
ters). For instance, Engen et al. (2002) examine 
changes in the shape of species abundance dis¬ 
tributions when such non-randomness follows a 
Poisson log-normal or negative binomial distribu¬ 
tion, rather than a Poisson distribution (also see 
Green and Plotkin (2007)). 


10.2.2 Obtaining predicted abundances 

Fitting a species abundance model to a sample of 
species abundances requires choosing a set of para¬ 
meter values from their ranges of possible values. 
This requires a method for generating model pre¬ 
dictions for each set of parameter values to be con¬ 
sidered, and a statistic that quantifies the model's 
agreement with the data for those parameter values. 
For many species abundance models, predictions 
can be generated exactly, by plugging parameter 
values into a closed-form expression. For example, 
for a random sample from a gamma distribution 
of species abundances (Pielou 1975), or as a result 
of demographic stochasticity with a constant rate 
of immigration (Volkov et al. 2007), the probability 
that a species has abundance r follows a negative 
binomial distribution: 


r(r + k) m k 

rim (1 + m) r+k 


( 10 . 2 ) 
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Figure 10.2 Illustration of veil-like effect of incomplete sampling on species abundance patterns. Dashed lines show (a) log-normal and (b) gamma 
relative abundances for an underlying community; the shaded lines show the (a) Poisson log-normal and (b) negative binomial species abundance pattern 
expected when sampling from these communities at different levels of intensity. Following Lande et al. (2003), each curve has been rescaled and shifted 
along the horizontal axis to highlight that, while the pattern resembles qualitatively an 'unveiling' of the underlying abundance distribution, the curves do 
not superimpose perfectly. 


where m and k are the scale and shape parameters of 
the underlying gamma distribution. 

In other cases, a mathematical expression for a 
model prediction exists, but a numerical approx¬ 
imation is required to generate a prediction. For 
instance, the Poisson log-normal distribution arises 
from Poisson sampling of individuals from a log¬ 
normal distribution of species abundances: 

OO 

exp(-A) A r " 

_ 7 \ 

A=0 



(10.3) 


where ln(m) and a are the mean and standard devi¬ 
ation of the natural logarithm of abundance, respec¬ 
tively. The first square brackets in the top line of 
equation 10.3 say that the probability that a species 
has abundance r in the sample is Poisson with rate 
parameter A, and the second square brackets say 
that the rate parameter A, which is proportional 
to relative abundance in the community, follows a 
log-normal distribution. In this case, the integral 
cannot be solved explicitly, so numerical methods 
for approximating its value must be applied. In 
some cases, the numerical calculations can be done 
(with the aid of a computer) almost as quickly as 


evaluating equation 10.2. In other cases, however, 
the necessary calculations are more cumbersome 
and can substantially slow down the process of 
parameter estimation because they may need to be 
made repeatedly for a large number of different 
combinations of parameter values. 

Some models require stochastic simulation to 
generate model predictions. Most niche apportion¬ 
ment models fall into this category, as do some sto¬ 
chastic abundance models that do not have analyt¬ 
ical solutions (e.g. Bell 2001). This greatly increases 
the amount of computational time required to gen¬ 
erate model predictions. Because each simulated 
model outcome is subject to stochastic variation, 
a large number of simulations must be conducted 
and generally averaged in some way to produce the 
model's prediction. However, even this averaging 
process does not completely eliminate stochastic 
error in model predictions. When the model has 
parameters that are estimated from data, this com¬ 
plicates the process of finding the parameter values 
that yield the best fit to the data, as we explain 
below. 


10.2.3 Choosing parameters 

Determining how well a model can explain pat¬ 
terns in empirical data often requires that parame¬ 
ter values be chosen from a range of possible values, 
but this is not always the case. For instance, under 
the dominance pre-emption species abundance 
model, the proportion of the available resource pool 
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monopolized by one species is drawn from a uni¬ 
form distribution: all proportions between 0.5 and 1 
are equally likely (Tokeshi 1990). A second species 
then consumes a similarly determined proportion 
of the remaining resources, and so forth. There is no 
parameter that can be varied to make some resource 
proportions more likely than others. 

For models that do have parameters, values for 
those parameters generally are estimated directly 
from the data being compared with the model. 
Several different approaches have been used to 
estimate parameters, which we review below. 
Regardless of the particular method, however, such 
estimates are based on some statistic that quantifies 
the discrepancies between data and model predic¬ 
tions for a particular set of parameter values. The 
values of model parameters are then changed until 
the value of this statistic is minimized. When a 
model has only one or two parameters, best-fit val¬ 
ues can usually be approximated by brute force: cal¬ 
culating the model fit statistic for a range of parame¬ 
ter values and finding the statistic's minimum value 
(e.g. Fig. 10.3a). However, when calculating model 
predictions or model fit statistics is computationally 


demanding, or when many parameters must be 
estimated, this approach can be impractical; there¬ 
fore, contemporary applications typically rely on 
optimization algorithms to find efficiently the max¬ 
ima or minima of functions. These algorithms start 
from an initial set of parameter values and change 
those values systematically until further changes 
cease to improve the fit. This is why generating 
models by stochastic simulation can cause difficul¬ 
ties in parameter estimation: the model fit statis¬ 
tic will be subject to small ups and downs due to 
stochastic simulation error, and this can cause opti¬ 
mization algorithms to fail (Fig. 10.3b). There is no 
readily available fix for this problem: researchers 
must check that any parameter estimates obtained 
genuinely do give the best fit, for instance by plot¬ 
ting goodness-of-fit profiles (Hilborn & Mangel 
1997). Failure to find the best fit can also occur 
when there are multiple 'local' peaks in model fit, 
meaning any small movement away from a com¬ 
bination of parameter values causes a reduction in 
model fit, even though a better fit is possible for a 
very different set of parameter values (see Etienne 
et al. (2006) for an example in species abundance 


(a) 


(b) 




Figure 10.3 (a) A surface plot of the negative sum of squares for the Poisson log-normal, fitted to rank abundance data simulated from a Poisson 
log-normal distribution with m = 1, a = 1.5 and 5 = 200. (b) A surface plot of the same model fitted to the same simulated data, but with the expected 
rank abundance distribution for each set of parameter values generated by averaging the rank abundance distributions for 1000 simulated data sets using 
those parameter values. 
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analysis). There is no foolproof way to avoid this, 
but it is good practice to re-start optimization algo¬ 
rithms from a range of different initial parameter 
values, to increase the likelihood that all of the local 
peaks in the model fit are found. 

We now turn to several common statistics used in 
parameter estimation. 

Maximum likelihood for species abundances 
Contemporary studies frequently fit species abun¬ 
dance models by maximum likelihood methods. 
This requires deriving, or numerically approximat¬ 
ing, the probability that a particular set of species 
abundance values will be observed, given specific 
values of model parameters. For instance, equa¬ 
tion 10.2 gives the probability that a species will 
have abundance r in a sample, given that species 
abundances follow a negative binomial distribution 
with parameters m and k. The support that an obser¬ 
vation provides for a particular set of parameter 
values, called the likelihood, is proportional to this 
probability: 

£(0 | r) oc Pr( r | 0) (10.4) 

where 6 is a vector of all of the model's parameter 
values (e.g. 8 - [m,k] for the negative binomial) and 
r is an observed species abundance. Note that for a 
species to appear in the likelihood, it must appear 
in the sample. If one is fitting a model in which 
some species in the underlying community may 
not appear in the sample, such as the Poisson log¬ 
normal or negative binomial, then the zero-truncated 
form of the model accounts for this: 

Pr(r | 0) = —^—, (10.5) 

r — p o 

where p r is the probability that a species has abun¬ 
dance r (e.g. as in equations 10.2 and 10.3). Thus, the 
denominator is the probability that the species does 
not have abundance zero, and Pr(r|0) is the proba¬ 
bility that a species has abundance r, given that it 
has appeared in the sample at least once. Most com¬ 
monly, equation 10.5 is extended to account for the 
entire sample by making the simplifying assump¬ 
tion that species' abundances are statistically inde¬ 
pendent of one another, and thus the likelihoods for 
the individual species are multiplied together: 


c I fote 

£(0|r) = (m6) 

n 4>n\ 5=1 

11=1 

Here, S 0 \, s is the total number of species in the 
data set, N is the total number of individuals sam¬ 
pled and is the number of species with abun¬ 
dance n. The fraction is a normalizing constant to 
account for the number of different ways that the 
observed species abundances can be divided up 
among the S 0 bs species sampled (usually ignored, 
because for any given sample of species abun¬ 
dances, it is independent of the model or parameter 
values). 

In reality, species abundances will not be statis¬ 
tically independent of one another. For instance, 
in sampling a given area for sessile organisms, if 
one species is particularly abundant, then there is 
less space available for another species. One way 
to account for such constraints is to condition on 
the total number of individuals that are sampled. 
In other words, given that one has observed a par¬ 
ticular set of species abundances that add up to a 
total sample size of N, what is the likelihood for 
a particular set of parameter values? This requires 
normalizing (i.e. dividing) by the probability that 
the species abundances add up to the observed 
sample size. Calculating this quantity can be very 
time-consuming because it requires calculating the 
likelihoods for all of the possible combinations of 
S 0 b s species abundances that give a total sample 
size of N, and adding them up (Etienne & Olff 
2004). 

Because accounting for such conditioning is 
potentially cumbersome, it is worth evaluating 
the robustness of parameter estimates obtained, 
using the simpler likelihood (equation 10.6), to vio¬ 
lation of this independence assumption. To explore 
this, we compared the bias of parameter estimates 
obtained using this likelihood when data are sim¬ 
ulated in two different ways: in the Poisson algo¬ 
rithm, individuals (and thus species) are sampled 
independently of one another, consistent with equa¬ 
tion 10.6; in the hypergeometric algorithm, individ¬ 
uals are sampled from the community only until 
a fixed total sample size is reached (see Box 10.1 
for details). We found no differences in the bias of 
parameter estimates for the Poisson log-normal or 
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negative binomial models (Fig. 10.4): Wilcoxon tests 
for paired observations confirm no significant dif¬ 
ferences in bias for any parameter values, for either 
model (P > 0.4 in every case). 


Box 10.1 Analysis of parameter bias 

due to fixed sample size 

For the Poisson log-normal distribution, we first 
simulated an underlying distribution of species 
abundances in the community for a particular total 
number of species, S true , and with a particular standard 
deviation of log abundance, cr. For the hypergeometric 
algorithm, we sampled N individuals from this 
community abundance distribution, where the 
probability that an individual belonged to species i, p„ 
was equal to the species' relative abundance in the 
community. We then fit the Poisson log-normal 
distribution to the resulting simulated sample abundance 
distribution, using the likelihood in equation 10.6. For 
the Poisson algorithm, for each species in the underlying 
community, we generated its numerical abundance in 
the sample by choosing a random number from a 
Poisson distribution with mean p,/V; thus, total sample 
size was a Poisson random variable with mean N. Note 
that, in both algorithms, the number of observed species 
in the sample can be (and generally is) less than 5 true 
because some species have abundance zero. For each 
bootstrap algorithm, we repeated this simulation 1000 
times for each of 27 combinations of parameter values 
and sample sizes: ct = [1,2,3], N = [100,1000,10000] 
and S, tue = [20,200,2000]. We calculated the mean 
standardized bias as the difference between the mean 
parameter estimate (across the 1000 simulations for a 
particular parameter combination) and the true value, 
divided by the standard deviation of parameter estimates 
among those simulations. 

For the negative binomial, we followed the same 
procedure, except that the underlying community 
abundance distribution was gamma, rather than 
log-normal, and the parameter and sample size 
combinations were k = [0.01, 0.1, 1 ],/V = [1000, 
10000, 100000] and 5 true = [20,200,2000], The larger 
values of N are required for this model to ensure that 
sufficiently many species appear in the sample for 
adequate converge of parameter estimation algorithms. 

See Connolly et al. (2009) for a detailed description of 
all algorithms. 


Poisson log-normal 

III (7 





Standardized bias (poisson) 

Figure 10.4 Comparison of the standardized bias of parameter values 
for data simulated according to the hypergeometric (fixed sample size) and 
Poisson (variable sample size) bootstrap algorithms. Each point represents 
the mean standardized bias for one of 27 parameter combinations for 
(a, b) the Poisson log-normal and (c, d) the negative binomial. See 
Box 10.1 for a detailed description of the bootstrap simulations. The large 
standardized biases (i.e. magnitudes > 1, lower left of panel b) 
correspond to cases with a = 3 and sample sizes smaller than total species 
richness: in these cases, the proportion of the total number of species that 
actually appeared in the simulated sample was very small. 

These analyses suggest that failing to condition 
on sample size is unlikely to substantially affect the 
biases in parameter estimates, at least for these two 
models. Moreover, it illustrates how this problem 
can be examined for other models. 


Maximum likelihood for abundance classes 
Several analyses first place species' abundances into 
abundance classes and then use maximum likeli¬ 
hood methods to fit model predictions of the num¬ 
ber of species in each abundance class against the 
corresponding observed values (e.g. Hubbell 2001). 
In general, this approach is likely to be inferior 
to using the actual abundance values because it 
removes information from the data that can provide 
important information about model fit. Because 
such abundance classes are usually logarithmic in 
nature (i.e. they get wider as abundance increases), 
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this loss of information will be particularly pro¬ 
nounced for the most abundant species. This is 
potentially quite important because the thickness 
of the tail of highly abundant species is one of the 
major differences between species abundance mod¬ 
els. For these reasons, fitting models to unbinned 
data should tend to lead to narrower confidence 
limits on model parameters (and for similar reasons 
should lead to stronger goodness-of-fit tests and 
model selection results). However, we have no par¬ 
ticular reason to suspect that parameter estimates 
would tend to be biased when data are binned. 
Consequently, we suspect that the use of abun¬ 
dance classes probably entails weaker statistical 
inferences, rather than erroneous inferences. 

There is one circumstance in which use of abun¬ 
dance classes for model fitting may be preferable 
and that is when model predictions must be gen¬ 
erated by stochastic simulation (e.g. Hubbell 2001). 
To calculate likelihood, each observed species abun¬ 
dance must have a corresponding model predic¬ 
tion for the probability that a species has that 
abundance. When model predictions are generated 
by simulation, such probabilities must be approx¬ 
imated by running a large number of simulations 
for a particular set of parameter values and then 
calculating the average proportion of species that 
ended up with that abundance in those simulations. 
However, when the true probability that a species 
will end up with a particular abundance value is 
very low, stochastic variation among ensembles of 
simulations can have a substantial effect on para¬ 
meter estimation. For instance, substantially dif¬ 
ferent likelihoods may be obtained, depending on 
whether one, two or three out of 1000 simulations 
produce a species with a particular species abun¬ 
dance that is observed in the data. Making predic¬ 
tions about abundance classes instead of specific 
abundance values will tend to make the outcomes 
of simulations more consistent with one another 
across replicate runs and therefore stabilize parame¬ 
ter estimates. 

Maximum likelihood for rank abundances 
Rank abundance distributions differ from normal 
species abundance distributions by associating a 
rank with each species' abundance (i.e. the most 
abundant species has rank 1, next most abundant 


has rank 2, etc.). This introduces an additional layer 
of statistical complexity on top of those relevant to 
unranked abundance distributions: a species with 
abundance rank j cannot be more abundant than 
the one with rank j - 1, and cannot be less abun¬ 
dant than the species with rank j +1. We know 
of only one maximum likelihood approach to rank 
abundance distributions (Foster & Dunstan 2009). 
This approach conditions on the total number of 
species, the total sample size and the abundances 
of all of the species with higher rank: 

Pr(n ; )=Pr(N) Pr(S]N) Pr(«i|S, N) Pr(n 2 |S, N, «i) 
... Pr(n s |S, N, «i, n 2 ,..., «s-i). (10.7) 

In cases where the probability distribution of abun¬ 
dances, for a species with a particular abundance 
rank, has a simpler form than the full species 
abundance distribution itself, this may be a more 
tractable approach to parameter estimation than 
one based on fitting to the frequency distribu¬ 
tion of species abundances. For instance, under 
the geometric series model, the abundance of a 
species is drawn from a probability distribution 
that depends on the abundances of all species with 
higher abundances, but not on species with lower 
abundances. Such models lend themselves natu¬ 
rally to approaches like this one. However, for 
many models (e.g. the Poisson log-normal, as far as 
we are aware), no likelihood of the form of equa¬ 
tion 10.7 has been formulated for the rank abun¬ 
dance distribution. 

Least-squares approaches 

Least-squares approaches are often used as an alter¬ 
native to maximum likelihood in the literature. 
Most commonly, these methods are applied to rank 
abundances by finding the parameter values that 
minimize 

X («/ - «/) 2 ’ ( 10 - 8 ) 

M 

where n t is the observed abundance of the species 
with rank j, and n) is the predicted abundance of 
a species with abundance rank j (Loehle & Hansen 
2005; Woodcock et al. 2007). However, least-squares 
methods have also been used to fit species abun- 
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dance data categorized into abundance classes 
(Gray et al. 2005; Pueyo 2006; Volkov et al. 2007). 

There are reasons to treat least-squares fitting 
of species abundance data with considerable cau¬ 
tion, particularly when applied to rank abundance 
data. The statistical theory that supports least- 
squares estimation holds if three conditions are met: 
observed values are statistically independent of one 
another, the variance of this distribution is the same 
for all predicted values, and the residual variation 
has a mean of zero. For rank abundance data, the 
first two of these assumptions are certainly violated, 
often quite severely, and there is no particular rea¬ 
son to expect the third assumption to be met either. 
Specifically, every rank abundance is constrained 
to be no larger than the abundance of the next 
more highly ranked species, and no smaller than the 
next lower-ranked species. These constraints make 
rank abundances highly non-independent. More¬ 
over, the width of this range of possible abundance 
values tends to narrow substantially from the most 
abundant to the least abundant species, so the resid¬ 
ual variation in abundances will also change sub¬ 
stantially as a function of rank. For instance, many 
species abundance distributions have a large num¬ 
ber of 'singleton' species (species represented by 
only one individual) in an assemblage. In such com¬ 
munities, the lowest-ranked species will be guaran¬ 
teed to have an abundance of exactly one individ¬ 
ual because the next-higher ranked species will also 
have an abundance of one. 

For least-squares fits to abundance classes, the 
situation is somewhat different. The 'observed' and 
'predicted' values are not the abundances of species 
with particular ranks, but rather the numbers of 
species falling in a particular abundance class. For 
this latter case, any statistical non-independence 
among species is likely to be weaker. Moreover, 
under random sampling, the stochastic variability 
around the number of species in an abundance 
class should approximately follow a Poisson distri¬ 
bution. Provided the predicted number of species 
in each abundance class is large enough (around 
five species or more), this Poisson variability should 
closely approximate a normal distribution. This 
does not resolve the problem of unequal vari¬ 
ances, which violates an assumption of the least- 
squares approach. However, because the variance 


and mean are equal for Poisson random variables, 
the residual variance can be homogenized by divid¬ 
ing each residual by the corresponding expected 
mean value. This approach is implicit in approaches 
that minimize Pearson's X 2 statistic (e.g. Doroghazi 
& Buckley 2008): 

x2 = ^ (0, -E,) 2 , (m9) 

where O, and £, are the observed and expected 
numbers of species in abundance class i, respec¬ 
tively. Thus, parameter estimation by minimizing 
X 2 should have similar properties to maximum like¬ 
lihood fitting to abundance classes, provided that 
expected values are not too small. 

Separately estimated parameters 
Although most species abundance analyses esti¬ 
mate parameters by finding the values that maxi¬ 
mize model fit, the parameters of some models can 
be estimated independently. Generally, this requires 
that the parameter represent a directly measurable 
biological quantity. For example, most neutral mod¬ 
els explicitly include a parameter to characterize 
the proportion of new recruits in a local commu¬ 
nity that arrive from elsewhere (e.g. the migration 
parameter m; Hubbell (2001)). If there is empirical 
information about dispersal in a group of species 
(e.g. the shape of the dispersal kernel), then m can 
be estimated based on this information (Etienne 
2005). Because this approach constrains the possible 
shapes that a model can take, it makes good agree¬ 
ment between a model and data less likely to occur 
by chance, and thus more likely to indicate that 
the model does, in fact, approximate the ecologi¬ 
cal processes that gave rise to the data. The closest 
example of this of which we are aware is a study of 
intertidal dynamics, in which mortality and immi¬ 
gration probabilities were estimated from observed 
transitions in space occupancy from experimental 
plots, then used to predict the species abundance 
distribution from independent transects at the same 
site (Wootton 2005). 

10.2.4 Goodness-of-fit testing 

Goodness-of-fit testing involves determining 
whether a model, once it has been fitted to data. 
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adequately captures the patterns in those data. 
Often, in species abundance analysis, such 
assessments are made subjectively, based (for 
instance) on the apparent visual concordance 
between observed and predicted values, or on 
the percentage of variance explained by a model. 
Formal goodness-of-fit tests determine whether the 
model's fit to the data is significantly worse than 
one would expect if the model were true, and, if so, 
how much worse. In species abundance analysis, 
such tests have included several conventional tests 
for frequency distributions (e.g. Kolmogorov- 
Smirnov, Pearson's X 2 or G-tests). More recently, 
however, computationally intensive methods have 
been applied that appear to be better at identifying 
lack of model fit. The last few years have also 
seen techniques for unpacking information about 
model fit to determine how model predictions 
depart systematically from empirical data. Finally, 
although not goodness-of-fit tests sensu stricto, 
model fit also can be evaluated by making 
predictions about patterns other than those used in 
the fitting procedure. This latter approach is a way 
of moving beyond a model's ability to describe the 
data to gain insight into whether the processes that 
generate the patterns in the data are consistent with 
those included in the model. 

Goodness-of-fit tests are null hypothesis tests: 
they determine whether or not one can reject the 
hypothesis that the data were actually generated by 
the specified model. However, there is a growing 
recognition among ecologists that models are, by 
definition, idealizations of nature and therefore 
that all models can be rejected, given sufficient 
data and a powerful enough goodness-of-fit test. 
This recognition has been partly responsible for 
sparking interest in model selection statistics, 
which aim to compare model fit relative to other 
models, rather than relative to the possibility that 
the model is somehow 'true', as in goodness-of-fit 
tests. However, the way in which a model fails to 
fit empirical data can sometimes suggest particular 
processes, omitted from the model, that may be 
important in nature (e.g. Dornelas et al. 2006). 
In addition, some model selection statistics are 
invalid if none of the alternative models provide a 
good approximation to the data. Finally, a model's 
utility for predicting unobserved data depends 


critically on how well it characterizes the true 
underlying distribution. Consequently, the analysis 
of goodness of fit remains important in the study 
of species distributions, but there is an emerging 
shift away from statistical significance per se, and 
towards quantifying the magnitude of lack of fit 
and pinpointing the particular features of the data 
that are poorly approximated by a model. 

Subjective methods 

Graphical assessment of model fit goes back to 
the earliest analyses of species abundance distri¬ 
butions (e.g. Fisher et al. 1943; Preston 1948) and 
is still widespread in the species abundance liter¬ 
ature (Ford & Lancaster 2007; Engen et al. 2008). 
Visual assessments are important for identifying 
what it is about the data that is responsible for 
their lack of agreement with a model, but they 
can be uninformative or even misleading when 
used in isolation, as can other ad hoc methods 
such as inspecting r 2 values. The predictions of a 
species abundance model, except as an extraordi¬ 
nary coincidence, will never match an empirical 
pattern exactly. Such discrepancies may be due to 
stochastic effects that are consistent with the model. 
For instance, a model may predict that the most 
abundant species includes 60% of the individu¬ 
als in a community, and this may, in fact, be true 
of the actual community, but the species may be 
slightly over- or under-represented in the sample 
being analysed. Alternatively, discrepancies may be 
due to the fact that the model poorly approximates 
the 'true' species abundance distribution. Subjec¬ 
tive methods cannot be used to distinguish rigor¬ 
ously between these two possibilities. 

Asymptotic tests 

Goodness-of-fit tests compare a model fit statistic 
with a null distribution. If the data are consistent 
with the model, the model fit statistic should be 
a random draw from that distribution. Statistical 
theory has been used extensively to identify statis¬ 
tics whose null distributions have the same basic 
shape regardless of details of the model, such as 
parameter values. Several such test statistics have 
been used in the analysis of species abundance data. 
For instance, the log-normal and logit-normal dis¬ 
tributions have been tested by conducting standard 
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normality tests on transformed abundances (Con¬ 
nolly et al. 2005; Williamson & Gaston 2005); how¬ 
ever, this approach is useful only for distributions 
that can be made Gaussian by transformation. More 
general tests involve the use of contingency tables 
comparing observed and predicted frequencies of 
species in different abundance classes via Pearson's 
X 2 or similar statistics (Yin et al. 2005; Caruso & 
Migliorini 2006; Pueyo 2006; Sitran et al. 2009), for 
which the null distribution is chi-squared. Other 
analyses have used the Kolmogorov-Smirnov test, 
but this test is very weak when model parameters 
are estimated from data, as in most species abun¬ 
dance analyses (Zar 1996). 

Parametric bootstrapping 

Asymptotic tests are based on mathematical 
approximations of what the distribution of a par¬ 
ticular statistic should be, given particular assump¬ 
tions about the data. Often, these simplifying 
assumptions are not well met. For example, the chi- 
squared null distribution is appropriate only when 
there are minimum frequencies in each abundance 
category, so, in practice, species must be placed in 
abundance classes that often span a large range of 
abundance values. This removes information from 
the data and can lead to tests with low statistical 
power. An alternative approach is to estimate the 
null distribution directly, by simulating data sets 
that accord with the assumptions of a particular 
fitted model and then computing a model fit sta¬ 
tistic for each simulation. The frequency distribu¬ 
tion of this test statistic across simulations is then 
used as the null distribution for the model's fit 
to the empirical data. Such approaches have been 
termed 'parametric bootstrapping' because one re¬ 
samples data from a fitted model to estimate the 
uncertainty distribution of a statistic, rather than 
re-sampling from the data itself (Efron & Tibshi- 
rani 1993). The disadvantage of this approach is 
that this null distribution is sample and model 
specific, and so must be conducted anew for each 
model and data set under investigation. However, 
advances in computing power are making such 
analyses increasingly practical, and its versatility 
makes it a compelling alternative to more tradi¬ 
tional approaches. In biology, for instance, para¬ 
metric bootstrapping is becoming commonplace in 


capture-mark-recapture analysis (White et al. 2001) 
and phylogenetics (Holmes 2003). 

In the analysis of species abundances, paramet¬ 
ric bootstrapping has been used relatively infre¬ 
quently, but several different test statistics have 
been proposed. Tokeshi (1990) first proposed test¬ 
ing model fit by simulation by comparing empir¬ 
ical rank abundance data with comparable data 
simulated from niche apportionment models, and 
Bersier and Sugihara (1997) subsequently proposed 
a test statistic for such analyses. Volkov et al. (2003) 
and Etienne et al. (2006) use the maximum log- 
likelihood of the model as the test statistic. Con¬ 
nolly et al. (2005; 2009) used deviance, a test sta¬ 
tistic based on the log-likelihood, but normalized 
to facilitate comparability across sites with different 
sample sizes and parameter values. 

Sample size: species or abundance? 

In most species abundance analyses, goodness-of- 
fit tests make an implicit assumption that species 
abundance values are sampled, rather than indi¬ 
viduals. For instance, the null distribution of Pear¬ 
son's X 2 test statistic is based on an assumption that 
the stochastic variability around predicted frequen¬ 
cies of species in an abundance category follows a 
Poisson distribution. This assumed variability cor¬ 
responds to a situation in which a certain num¬ 
ber of species are sampled, each of which has 
an abundance label attached to it. Similarly, in 
some parametric bootstrap analyses, one samples 
an abundance value for each species observed in an 
empirical sample (Diserud and Engen 2000; Con¬ 
nolly et al. 2005; Engen et al. 2008). However, in 
most ecological sampling, a particular area or vol¬ 
ume of habitat is sampled, and this sample contains 
some number of individuals, which in turn have 
species identities. In particular, a replicate sample 
might not yield the same number of species because 
some species that are present in the community do 
not appear in a given sample, and that number is 
subject to stochastic sampling variation. 

It is possible to develop parametric boot¬ 
strap algorithms that simulate this process of 
re-sampling more realistically than the species sam¬ 
pling approach assumed by most goodness-of-fit 
tests. Etienne (2007) provides such an algorithm 
for a neutral model, for which dispersal limitation 
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determines a species' presence and abundance in a 
sample, and Connolly et al. (2009) provide an algo¬ 
rithm for models that assume random sampling 
from an underlying community abundance distri¬ 
bution (e.g. Poisson log-normal, negative binomial, 
etc.)- These algorithms lead to goodness-of-fit statis¬ 
tics whose null distributions are substantially nar¬ 
rower, and therefore more powerful at detecting 
lack of model fit, than analogues based on species 
sampling assumptions (Connolly et al. 2009: also 
see Box 10.1). Such differences suggest that, in gen¬ 
eral, goodness-of-fit tests based on the sampling 
of individuals may be more powerful at detect¬ 
ing departures from species abundance models 
than approaches that implicitly or explicitly assume 
sampling of species. 

Multi-pattern testing 

Most assessments of model fit compare a model's 
predicted values with the same data that were 
used to fit the model. Such assessments tell us 
whether the model adequately describes the species 
abundance data. However, frequently the goal of 
species abundance analysis is to go beyond this and 
determine whether that good fit indicates that the 
model approximates well the biological processes 
that generated the data. One way to do this is 
to test whether the parameters implied by fit to 
species abundance data can also explain other 
characteristics of the assemblage being investigated 
(McGill 2003c). Most examples of this approach have 
involved tests of neutral models. For instance, Adler 
(2004) fitted the neutral model to species abundance 
distributions of local grassland assemblages, used 
those parameter estimates, to determine the species- 
area relationship implied for that assemblage and 
then compared that prediction with the empirical 
species-area relationship. Wootton (2005) estimated 
neutral model parameters from temporal transitions 
in space occupancy in a benthic community and 
then tested the model's ability to predict community 
change when a hypothesized competitive dominant 
was removed. Dornelas et al. (2006) fitted the 
neutral model to species abundance distributions 
of scleractinian reef corals, predicted the frequency 
distributionofcommunitysimilarityimpliedbythose 
estimates, and compared that with the community 
similarity distribution exhibited by the data. 


10.2.5 Model selection 

In species abundance analysis, we often wish to 
know which of several alternative models provides 
the best approximation for the data, in addition 
to (or instead of) assessing their absolute good¬ 
ness of fit. Many studies have compared graphically 
empirical data with the predictions of different 
species abundance models. For instance, many of 
the early analyses that suggest multimodality in 
species abundance distributions were based on 
graphical inspection of species abundance distrib¬ 
utions (Gray & Mirza 1979; Ugland & Gray 1982). 
Other approaches are essentially comparisons of 
goodness-of-fit statistics for different models, such 
as r 2 values calculated from rank abundance dis¬ 
tributions, Pearson's X 2 statistics calculated from 
frequency distributions of species abundance or 
statistics calculated from parametric bootstrapping 
(McGill 2003b; Volkov et al. 2007; Harte et al. 2008; 
Connolly et al. 2009). 

Although approaches like those just described 
can be informative, they have important limitations. 
Alternative models may differ in how much uncer¬ 
tainty is associated with their predicted abundance 
patterns. Models with very flexible forms due to, 
for example, large numbers of parameters, are more 
likely to be able to provide good fits to data, even 
if they would do a poor job of predicting what a 
new sample might look like. In addition, the model 
with the lowest r 2 value, or which appears visually 
to give the best fit, may depend critically on the 
scale on which species abundances are represented. 
For instance, for rank abundances considered on an 
arithmetic scale, apparent model fit will be much 
more dominated by the model's fit to highly abun¬ 
dant species, compared to when rank abundances 
are represented logarithmically. Moreover, differ¬ 
ent models make different assumptions about the 
sources of variability in species abundance data and 
thus differ in terms of how much a given discrep¬ 
ancy between the model and data is consistent with 
a model's assumptions. 

To overcome the subjectivity inherent in such ad 
hoc model selection approaches, biologists in many 
fields are increasingly shifting towards the use of 
model selection statistics that have a stronger foun¬ 
dation in statistical theory. Although this practice 
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has not yet become widespread in the analysis 
of species abundance data, several model selec¬ 
tion statistics have been used, including Akaike's 
information criterion (AIC; Connolly et al. 2005), 
Bayes factors (Etienne & Olff 2005), Bayesian (or 
Schwarz) information criterion (BIC; Dornelas & 
Connolly 2008) and deviance information criterion 
(DIC; Golicher et al. 2006; Mac Nally 2007). Of these 
statistics, AIC and BIC are straightforward to cal¬ 
culate, once one has maximum likelihood estimates 
for parameters, whereas calculation of Bayes fac¬ 
tors and DIC are more complicated, sometimes con¬ 
siderably so, because they incorporate prior beliefs 
about model parameter values. All such model 
selection statistics, however, quantify the trade¬ 
off between the increased uncertainty associated 
with more complex models and the increased bias 
(i.e. systematic discrepancies between model pre¬ 
dictions and data) associated with simpler models. 
Moreover, because they are based on likelihood, 
discrepancy is measured on a natural scale, and 
thus these statistics do not have the arbitrariness of, 
for instance, choosing whether to plot rank abun¬ 
dances arithmetically or logarithmically. 

Akaike's information criterion 

AIC estimates the amount of information that is 
lost when a model is used as an approximation 
for the true distribution from which the data have 
been drawn (Burnham & Anderson 1998). Thus, 
it provides a relative measure of a model's ability 
to predict new data sampled from the same true 
distribution. In many ecological applications, where 
sample sizes are relatively small, a modified form of 
this criterion, called AIC c , is used; 

AlC c = -2 log(C m3X ) + 2k+ 2k{ ^ l) (10.10) 

where £ max is the model's maximum likelihood, 
k is the number of parameters in the model, and 
n is the sample size. The first term measures the 
model's lack of fit to the data in hand, while the 
second two terms account for the fact that models 
with more parameters tend to predict the values 
of new data with greater uncertainty. Notice that 
the third term becomes very small as sample size 
(n) gets large, so that, for moderately large sample 
sizes, AIC depends only on the model's overall fit 


and the number of model parameters. Uncertainty 
about which model is best can also be estimated 
with AIC by calculating Akaike weights, according 
to which the probability that a model is the best 
model is proportional to e~ Alc/2 . 

Like all model selection statistics, AIC is contro¬ 
versial. Proponents of AIC typically emphasize the 
fact that it is an estimate of the information lost 
when the model is used as an approximation for 
the true distribution from which the data come, and 
thus is defensible on objective grounds as an aim 
of model selection. Critics of AIC most frequently 
cite the fact that it is not consistent, which means 
that, when one of the models under consideration 
was actually used to generate the data, the prob¬ 
ability that AIC selects this true model does not 
approach 100% as sample size increases. For exten¬ 
sive discussion of the merits and shortcomings of 
AIC, readers should consult the relevant statistical 
literature directly (see Burnham & Anderson (1998) 
and Taper and Lele (2004) for ecologically oriented 
discussion). 

Finally, it is worth noting that the derivation of 
AIC at several points uses the model as an approxi¬ 
mation for the true distribution from which the data 
come, and therefore AIC is usually recommended 
only for models that are good approximations to the 
truth. Thus, goodness-of-fit testing is an indispens¬ 
able part of model selection using AIC. 


Bayesian approaches 

To understand Bayesian approaches to model selec¬ 
tion, it is important to recognize that all such 
approaches involve the application of Bayes' theo¬ 
rem to calculate probability distributions for model 
parameters: 


p( % ,M)= Pr -^^^ e|M) 


( 10 . 11 ) 


Pr(y|0, M) is equal to the likelihood: the probabil¬ 
ity of observing the data y, if the model, M, with 
a particular set of parameter values, 0, were true. 
p(0\M) is the probability distribution for the para¬ 
meters, prior to the data being collected (the prior). 
This prior may be based on previously available 
independent data, it may reflect subjective belief 
or (most commonly) is chosen so that its effect on 
p(0|y, M), called the posterior distribution, is very 
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small relative to the likelihood. The posterior dis¬ 
tribution is an updated probability distribution for 
the model parameters, in light of the data. Thus, 
if a particular set of parameter values produces an 
extremely good model fit compared to another set 
of parameter values, then the first set will have 
a much larger posterior probability, relative to the 
second set, than in the prior distribution, c is a 
normalizing constant chosen so that the posterior 
distribution, p(6\y, M), is a probability distribution 
(i.e. so that integrating p(B\y, M) over all possible 
combinations of parameter values yields 1.0). One 
can think of the prior and posterior distributions 
as estimating the probability that a particular set of 
parameter values are the 'true' ones. Alternatively, 
they may be interpreted as estimating the probabil¬ 
ity that a particular set of parameter values are the 
ones that yield the best approximation for truth that 
is possible for a particular model (sensu AIC; see, 
for example, Spiegelhalter et al. (2002)). 

The most intuitive Bayesian model selection sta¬ 
tistic is the Bayes factor (also called the posterior 
odds ratio) and it is applicable where only two com¬ 
peting models are under consideration: 

_ Pr(ylM) Pr (Mi) 

Pr(y|M 2 ) Pr (M 2 ) 

[/ 9i Pr(y|fl lt M : ) p (0i|Mx)<i0i] Pr(M,) 

[fe 2 Pr (yl02, M 2 )p(0 2 |M 2 )d0 2 ] Pr(M 2 ) 

( 10 . 12 ) 

Pr(i/|Mi) is the probability of observing the data, 
y, if model Mi were true. To calculate this, we 
must first calculate the probability of observing 
the data, given this model and a particular set of 
parameter values (0i) for that model: Pr (y\8 \, Mi). 
Then, we average this probability over all of the 
possible sets of parameter values for the model, 
weighted according to the prior probabilities for the 
model parameters. That weighted average is the 
integral in the numerator and it, in turn, is multi¬ 
plied by Pr(Mi), which is the prior probability for 
the model. Often this quantity is assumed to be 
equal for all competing models, in which case the 
Bayes factor just estimates the support provided by 
the data for the competing models, analogous to 
a traditional likelihood ratio. In practice, the inte¬ 
grals in equation 10.12 are very hard to calculate, 


so Bayes factors are typically only used where 
a computationally intensive numerical technique, 
Markov Chain Monte Carlo (MCMC), is applied; 
this approach can be used to estimate such inte¬ 
grals. 

The BIC (Schwarz 1978), although derived from 
Bayes factors, is much simpler to calculate: 


BIC = 2 


- l0g(£ ma:< ) + 


fc log(ft) 
2 


(10.13) 


Here, the penalty term for extra parameters, 
k log(n)/2, arises in the derivation of BIC as an 
approximation of the difference between a model's 
maximum log-likelihood and the weighted average 
likelihood that appears in equation 10.12. In con¬ 
trast to the penalty term in AIC, it gets larger as 
sample size increases. In other words, if sample size 
is large, then a model with more parameters will 
need to exhibit a larger improvement in fit to be 
selected as the best model, compared to what is 
required if sample size is small. Unlike AIC, BIC 
is consistent: it selects the true generating model 
with increasing certainty as sample size increases. 
Thus, if one of the models under consideration is 
believed to be true, then BIC should be preferred. 
However, if one instead believes that the true distri¬ 
bution from which the data come is more complex 
than any of the models being compared, then there 
is some disagreement about whether or not it is 
appropriate to use BIC as a model selection statis¬ 
tic (Burnham & Anderson 1998; Spiegelhalter et al. 
2002; Boik 2004). An additional caveat with BIC is 
that its derivation assumes a very large sample size. 
There is no correction for small sample size, as with 
AIC, because if sample size is not large, then, in 
the Bayesian framework, prior beliefs about the val¬ 
ues of model parameters have an impact on model 
selection. 

Finally, DIC is a Bayesian approach based on 
model deviance (Spiegelhalter et al. 2002). Deviance 
estimates lack of fit as the difference between the 
log-likelihood of a model and the log-likelihood of 
a hypothetical model that fits the data perfectly 
Formally, DIC is: 

DIC = D (6) + [d(0) - D (5) j, (10.14) 


where D{8) is an average model deviance, weighted 
according to the posterior distribution of the model 
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parameters, and D (&) is the model deviance, evalu¬ 
ated at the posterior mean parameter values. Here, 
the term in square brackets is analogous to the 
penalty terms in AIC and BIC because the differ¬ 
ence between D(ff) and D (5) will tend to be larger 
for models with more parameters. When the prior 
distribution on model parameters is very flat, or 
there is a very large amount of data, then DIC esti¬ 
mates the same quantity as AIC (Spiegelhalter et al. 
2002). However, when parameters are estimated by 
MCMC methods, DIC can be calculated from the 
posterior distribution of model parameter values, 
therefore DIC may be preferable to AIC when the 
investigator wishes to incorporate prior distribu¬ 
tions for parameter values or where the number of 
parameters or the sample size is not easy to count 
(as in some hierarchical models and data struc¬ 
tures), and thus the value of AIC is ambiguous. 

10.3 Prospectus 

The analysis of species abundance patterns has 
been a staple of macroecology for nearly a cen¬ 
tury. This spans a time period from just after the 
invention of the likelihood concept, to the invention 
of computers, of information theory, of the emer¬ 
gence of 'frequentist' and 'Bayesian' approaches 
as competing schools of thought in statistics, of 
bootstrapping, and of numerical methods of para¬ 
meter estimation that make shortcuts such as 
least-squares unnecessary. Consequently, the tools 
available for fitting and evaluating models have 
expanded enormously since geometric, log-normal 
and log-series distributions were first compared 
with species abundance data. Having reviewed the 
range of tools applied in contemporary analysis, we 
offer some guidelines about how these tools may 
be utilized productively in the analysis of species 
abundance patterns and highlight some important 
areas for further research. 

10.3.1 Sampling theory for species 
abundance models 

Sampling distorts the shape of species abundance 
distributions, often substantially. Therefore, when¬ 
ever possible, compound distributions that explic¬ 
itly include both a model for the 'true' underly¬ 
ing species abundance distribution and a model 


characterizing sampling from that distribution 
should be used. However, for some species abun¬ 
dance models, such as niche-apportionment mod¬ 
els, such compound distributions have not yet been 
formulated, and this is an important area for further 
research. In addition, compound distributions that 
explicitly characterize the effects of local aggrega¬ 
tion at the sampling site, or differences in species 
detectability, on species abundance data have only 
been explored in a few studies. Most ecologists do 
not sample randomly at the scale for which they 
wish to draw inferences; rather, they are more likely 
to intensively sample at multiple small sites spread 
over the broader area of interest. When local aggre¬ 
gation occurs, the abundance distributions at study 
sites will differ from that of the area as a whole. 
Consequently, an accessible and relatively general 
framework for non-random sampling is needed. 

In addition, most sampling theory developed 
to date has focused on counts of individuals and 
thus is appropriate for numerical abundance data. 
Accounting for sampling effects in other currencies 
of abundance (e.g. cover, biomass) is generally lack¬ 
ing, but could proceed in at least two ways. One 
would be to consider sampling effects at the level 
of individuals, alongside the infinitely many ways 
that the continuously varying biomasses of individ¬ 
uals could sum up to a particular observed biomass. 
The other would be to apply a distribution anal¬ 
ogous to equation 10.1, where the first term char¬ 
acterizes stochastic variation in biomass sampled 
given a particular area, volume or total community 
biomass sampled. However, we know of no theory, 
or even empirically documented regularities, that 
might be used to characterize biomass aggregation, 
as exists for individuals (e.g. Harte et al. 1999; He 
& Gaston 2003). This, too, stands as an important 
challenge for future research. 

10.3.2 Parameter estimation 

We favour approaches to parameter estimation 
that utilize maximum likelihood methods wher¬ 
ever possible. In our view, fitting models by 
eye is unnecessarily subjective and, given the 
accessibility of software to fit frequency distri¬ 
butions, no longer justifiable. Moreover, except 
where stochastic simulation is used, binning species 
abundances before fitting models is not necessary 
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and probably reduces the precision of parame¬ 
ter estimates. More generally we are wary of 
least-squares methods for fitting species abundance 
models because they are supported by statisti¬ 
cal theory only under approximate normality and 
homoscedasticity. In general, these conditions will 
not be met for rank abundance data, and will 
only be met for species abundance distributions 
when abundances are categorized, probably quite 
coarsely so for the tail of very highly abundant 
species, and the data or residual variances are 
appropriately transformed. Nevertheless, despite 
its strong theoretical foundations, maximum likeli¬ 
hood is not perfect. In particular, maximum likeli¬ 
hood estimates can be biased, particularly for distri¬ 
butions that, like species abundances, exhibit sub¬ 
stantial skew (Fig. 10.4; also see Diserud and Engen 
(2000) and Connolly et al. (2009)). Consequently, we 
believe that a rigorous, comparative analysis of dif¬ 
ferent approaches to parameter estimation would 
be informative. 

Many species abundance models allow species to 
have abundances that can be arbitrarily large, but 
in real ecological samples, quadrat sizes or transect 
lengths place upper bounds on the total number of 
individuals that can appear in a sample. To account 
for this, some researchers have argued for the use 
of likelihoods that condition on the total number 
of individuals actually sampled, rather than treat¬ 
ing each species' abundance as completely inde¬ 
pendent of all others, thus implicitly allowing for 
unrealistically large or small sample sizes (e.g. Eti¬ 
enne & Olff 2005). Although we have not found 
this statistical independence assumption to increase 
the bias in parameter estimates (Fig. 10.4; also see 
Connolly et al. (2009)), we do not know whether 
this robustness is generally true for species abun¬ 
dance models or what impact it may have on model 
selection statistics. Further work on this question 
may help to identify if, and under what condi¬ 
tions, constraints on total sample size have impor¬ 
tant effects on the results of species abundance 
analysis. 

For species abundance models that are explicitly 
derived from some ecological theory (e.g. stochas¬ 
tic abundance models), the use of external inde¬ 
pendent information about the values of model 
parameters has great potential to strengthen the 
inferences that can be drawn from fits to species 


abundance models (Wootton 2005). This kind of 
approach can be further developed by explicitly 
incorporating uncertainty about model parameter 
values from such independent information. Thus, 
for a given set of parameter values, there would be 
two likelihoods: one for the species abundance data, 
and a second one for the independent data. These 
likelihoods would be multiplied together to obtain 
the overall likelihood for the parameters given both 
data sets. Similarly, in a Bayesian approach, the 
independent information could be used to con¬ 
struct a prior probability distribution for the model 
parameter(s). To our knowledge, such approaches 
have yet to be explored in species abundance 
analysis. 

10.3.3 Goodness-of-fit testing 

Failure to find statistically significant lack of model 
fit indicates only that the investigator has either 
not sampled enough or has used an insufficiently 
powerful goodness-of-fit test to detect it. Claims 
that a model fits data well should be supported 
by evidence that any lack of fit is small in mag¬ 
nitude, in some meaningful sense, regardless of 
statistical significance. Similarly, the use of model 
selection by AIC should be accompanied by evi¬ 
dence that any lack of fit that is present is likely 
to be small (Burnham & Anderson 1998). Con¬ 
versely, finding statistically significant lack of fit is 
a first step to determining what it is about the data 
that the model fails to capture, and whether this 
information implicates particular processes, omit¬ 
ted from the model, as important determinants of 
relative abundance patterns. In this context, we 
believe graphical approaches to assessing model fit 
are most useful because they can indicate where 
discrepancies between models and data are large. 
When compared with the discrepancies that are 
consistent with the model's stochastic elements 
(e.g. as produced in parametric bootstrap simu¬ 
lations), such graphical assessments can be made 
more rigorously (Connolly et al. 2009). Graphical 
assessments are also more likely to bear fruit if a 
variety of representations of model fit are used, 
including species abundance distributions and rank 
abundance distributions, and using a variety of 
transformations of the relevant axes (Pueyo et al. 
2006; Etienne et al. 2007b). 
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Species abundance models make predictions not 
only about the expected number of species with 
a particular abundance, but also about the site-to- 
site variability in relative abundance patterns. This 
information can also be used in goodness-of-fit test¬ 
ing, when multiple species abundance distributions 
are available. This idea was first used for repli¬ 
cate samples from the same community (Bersier 
& Sugihara 1997), but can be extended to sites 
from different locations that have different species 
abundance model parameters (Connolly et al. 
2009). For data that include multiple species abun¬ 
dance samples, predictions about the variability in 
species abundance statistics among samples can 
provide additional information that individual fits 
cannot. 


10.3.4 Model selection 

Cross-validation is a well-established, widespread 
model selection technique that has not been used 
at all in the analysis of species abundances, to 
our knowledge. Because it is such an intuitive 
approach to model selection, it probably deserves 
some attention. However, the best way to approach 
cross-validation in the context of species abundance 
analysis is not immediately obvious. For instance, 
if one simply eliminates a species, chosen at ran¬ 
dom, from a dataset and re-fits a species abundance 
model, then one makes the assumption that the 
species is the unit that is sampled. This assumption 
has substantial effects on the results of goodness- 
of-fit testing (Connolly et al. 2009), so we are wary 
of giving it critical importance in model selection. 
Alternatively, however, it may be possible to sub¬ 
sample individuals from within species abundance 
distributions, or to subsample sites from a collec¬ 
tion of replicate sites. For instance, one could ran¬ 
domly sample half of the individuals from a data 
set, fit a species abundance distribution to those 
individuals and then test the fitted model against 
the abundance distribution for the other half. Simi¬ 
larly, where multiple sites are distributed randomly 
within a larger area, and the species abundance dis¬ 
tribution is expected to have the same parameters in 
each site, then a species abundance model fitted to 
some of the sites could be tested against the abun¬ 
dance distributions from the other sites. 


In general, there is considerable controversy 
about what approach to model selection is the best 
one, in ecological or other applications. Therefore, 
rather than recommending the use of a particu¬ 
lar statistic or method, we encourage analysts to 
understand and take appropriate account of the 
assumptions that underlie the model selection sta¬ 
tistics that they do use. Many applications of AIC, 
for instance, are undertaken without any attempt to 
evaluate goodness of fit, despite the fact that one 
must assume, in order to derive AIC, that the model 
in question is such a good approximation that it 
can be substituted for the truth. Similarly, when 
Bayesian model selection statistics are used, and the 
prior distribution is chosen for mathematical conve¬ 
nience or on the basis of subjective belief, an assess¬ 
ment of the robustness of a study's conclusions to 
those priors is important. Finally, graphical analysis 
of model fit can be an informative complement to 
formal model selection, just as it is to goodness-of- 
fit testing. 

10.3.5 Conclusions 

Parameter estimation, goodness-of-fit testing and 
model selection are tools for obtaining as good 
a fit of a model to data as possible—either the 
data in hand, or a new sample of the same kind 
of data from the same sampling universe. For 
most ecologists, however, the goal of species 
abundance analysis is not to describe variation in 
relative abundances, but rather to use information 
about model fit to draw inferences about the 
processes that drive community structure or 
dynamics. Rigorous statistical techniques help 
ecologists quantify how well ecological models 
characterize patterns in nature, but they are an 
aide to, and not a substitute for, thinking about the 
processes that give rise to those natural patterns. 
Philosophers, historians and sociologists of science 
have increasingly recognized that the evaluation 
of models in science often depends on much more 
than goodness of fit or predictive accuracy. In 
practice, scientists often evaluate models based in 
part on how consistent they are with existing, well- 
supported theory, or on their explanatory power— 
their ability to provide unified explanations for a 
large number of patterns that previously required 
separate explanations (Kosso 1992). 



140 BIOLOGICAL DIVERSITY 


Consider the evaluation of neutral models as 
an example. Neutral theory offers considerable 
explanatory power, purporting to explain not only 
patterns of species abundance, but species-area 
relationships and even patterns of speciation and 
extinction in the fossil record (Hubbell 2001). On 
the other hand, there is considerable empirical evi¬ 
dence for niche differences and life history tradeoffs 
among organisms. There is also an extensive body 
of ecological theory, supported by experiments, that 
links such patterns to species coexistence. Con¬ 
sequently, neutral models, by explaining species 
coexistence as transient, driven by demographic 
stochasticity and with species differences playing a 
negligible role, appears to contradict a large body 
of ecological theory and data about species coexis¬ 
tence (Abrams 2001; Mazancourt 2001). This may 
explain why some of the more influential tests of 
neutral theory have examined how well-known 
ecological realities omitted from neutral theory 
drive the very patterns that neutral models seek to 
explain (Fargione et al. 2003) or assessed the extent 
to which neutral models really can explain mul¬ 
tiple patterns simultaneously (Adler 2004). Those 
two kinds of tests address these important (albeit 
subjective) criteria for evaluating models: poten¬ 
tial explanatory power and consistency with estab¬ 
lished and well-supported theory. 

Bayesian approaches offer a way to combine 
these different kinds of considerations by means 
of prior probability distributions on parameters 
or prior probabilities assigned to models based 
(for instance) on their mechanistic plausibility or 
their explanatory power. However, there will prob¬ 
ably never be ways to objectively determine how 
much to weight to give these different consid¬ 
erations, alongside model fit, when evaluating 
models. Thus, model evaluation in the broadest 
sense—the scientific community's judgment about 
a model's potential to be a productive frame¬ 
work for future research—will probably always 
involve subjective elements. Indeed, for precisely 
this reason, we believe that the use of rigorous 
and powerful statistical tools for assessing model 
fit is crucial: by providing objective information 
important to model evaluation, it can help to make 
the dialogue between theory and data as fruitful as 
possible. 


10.4 Key points 

1. Fitting models to species abundance distrib¬ 
utions entails considerations about four main 
issues: obtaining model predictions, estimat¬ 
ing parameters, testing the goodness of fit and 
selecting among competing models. 

2. Because sampling affects the shape of species 
abundance distributions, often substantially, 
species abundance models should incorporate a 
sampling theory. 

3. We favour maximum likelihood methods 
for parameter estimation because commonly 
used alternatives are problematic on statistical 
grounds. Maximum likelihood will probably 
perform best when applied to abundances that 
have not been grouped into abundance classes, 
unless model predictions must be obtained 
by stochastic simulation. However, a rigorous 
comparative analysis of different approaches 
to parameter estimation has not yet been 
undertaken. 

4. Rigorously quantifying goodness of fit is an 
important part of species abundance analysis, 
even if model selection is used instead of clas¬ 
sical null hypothesis testing. Parametric boot¬ 
strapping is a robust and versatile alternative 
to conventional goodness-of-fit testing, particu¬ 
larly useful where conventional tests have low 
statistical power. Graphical analyses comple¬ 
ment formal goodness-of-fit testing by helping to 
identify the features of the data most responsible 
for lack of fit. 

5. Model selection statistics compare models 
according to different criteria, reflecting 
philosophical differences about the nature and 
purpose of statistical analysis, and they should 
be used and interpreted with these differences in 
mind. However, all involve a trade-off between 
how well a model fits the data in hand, and 
how flexible the model is (i.e. how readily it 
might fit noise in the data as if it were part of 
the pattern). Consequently, they offer a stronger 
basis for comparing the extent to which models 
capture real patterns in particular empirical 
data sets, relative to traditional approaches like 
graphical inspection and comparative analysis 
of goodness-of-fit statistics. 



CHAPTER 11 


Species occurrence and occupancy 

Kevin J. Gaston and Fangliang He 


11.1 Introduction 

Arguably, one of the most fundamental units of 
biodiversity is the presence or absence of a species 
in a given site (a resource or habitat patch, an 
island, a mapping emit, etc.). This is evidenced by 
the fact that simple species x sites (r x c) matrices, 
in which the presence/absence of different species 
(in rows r) is given for a set of different sites (in 
columns c), lie at the heart of probably the vast 
majority of biodiversity studies (Simberloff & Con¬ 
nor 1979; Bell 2003; Arita et al. 2008; Gaston et al. 
2008a). Indeed, some of the most basic patterns 
which emerge from presence/absence r x c matri¬ 
ces have long intrigued ecologists, such as species- 
area relationships, nestedness, and gradients in (1 
diversity. 

Summing the presences along a row of an r x c 
matrix gives the level of occupancy of a given 
species, often most usefully expressed in terms of 
the proportion (p) of the available sites that are 
actually occupied (or, equivalently, the probability 
that the species occurs in any one site). This may 
vary greatly from one species to another, and the 
question of why some species are widely distrib¬ 
uted and others narrowly has long been a staple 
of ecological discourse (e.g. Darwin 1859; Harper 
1981; Rabinowitz 1981; Gaston 1994; Kunin & Gas¬ 
ton 1997). The actual area over which a species is 
distributed (i.e. the summed areas of the sites at 
which it occurs) has been termed its 'area of occu¬ 
pancy' (Gaston 1991; Gaston & Fuller 2009), and is 
important for some, particularly applied, consider¬ 
ations, such as evaluating the risk of extinction that 
species face in the short term as a consequence of 
anthropogenic pressures (Mace et al. 2008). 

The level of occupancy attained by a species is 
strongly influenced by the spatial resolution of the 


occurrence data (i.e. by the size of a 'site'). This is 
simply because the level of occupancy itself fails 
to capture other significant features of spatial dis¬ 
tribution, particularly the way in which at a fine 
resolution occupied sites are spatially dispersed. 
On average, of two species with the same occu¬ 
pancy at a fine scale, that with the more dispersed 
pattern of occurrences will have the greater occu¬ 
pancy at a coarser resolution (one simple widely 
used measure of this dispersion is the area con¬ 
tained within the geographically outermost occur¬ 
rences of a species, its 'extent of occurrence'; Gas¬ 
ton 1991; Gaston & Fuller 2009). Although it is 
only relatively recently that the analytical form of 
the relationship between spatial resolution and the 
observed level of occupancy of a species (Fig. 11.1; 
the occupancy-area relationship) has come to the 
fore, the potential significance of spatial resolu¬ 
tion on observed patterns of occupancy has been 
recognized for much longer (e.g. Erickson 1945; 
Rapoport 1982). 

The occupancy of a species is only a crude carica¬ 
ture of the distribution of its individuals. However, 
the level of occupancy attained across a set of sites 
is inevitably a function of the number of individuals 
summed across those sites. Only one site can be 
occupied when there is a single individual (assum¬ 
ing that sites are defined to be at least larger than 
an individual), and there is almost invariably some 
upper limit to the number of individuals that can 
physically co-occur in a single site, even ignoring 
all of the factors that can act to disperse individ¬ 
uals more widely (e.g. air and water currents, life 
history, behaviour). Much attention has been paid 
to the actual form taken by occupancy-abundance 
relationships (Fig. 11.1), being motivated initially 
mostly by the wish to be able to estimate the abun¬ 
dance of a species from its much more readily (and 
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Figure 11.1 (a) Occupancy-abundance curves for 817 tree species from the 50 ha (1000 x 500 m) Pasoh forest plot, Malaysia. The lowermost curve is 
constructed for the occupancy calculated by cell size a = 10 x 10 m, the middle curve is for a = 25 x 25 m, and the uppermost curve for 
a = 100 x 100 m. (b) Occupancy-area curves for 20 species randomly selected from the 817 species, (c) The distribution of the abundance of the 817 
species, serving as a reference for the occupancy distributions of three cell sizes (d, e, f). 


economically) estimated level of occupancy (e.g. 
Nachman 1981, 1984; Wilson & Room 1983; Kuno 
1986; Ward et al. 1986). Although such considera¬ 
tions have in the main concerned agricultural pests 
(and the need for indicators of when to take con¬ 
trol measures against them), more recently they 
have extended particularly to species of conser¬ 
vation concern (and the need to understand their 


population sizes; e.g. Tosh et al. 2004; Sara 2008; 
Figueiredo & Grelle 2009). 

By contrast, the frequency distribution of 
the occupancy levels of the species in an r x c 
matrix (i.e. the distribution of the row totals), 
the species occupancy distribution (Fig. 11.1), has 
been subject to investigation by generations of 
ecologists (e.g. Willis 1922; Gleason 1929; McIntosh 
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1962; Williams 1964; Anderson 1977; Hengeveld 
1990; Gaston & Blackburn 2000). The form of this 
distribution serves to move consideration beyond 
the question of why some species are more widely 
distributed than others to that of why typically 
a disproportionately high number of species 
are narrowly distributed, and the circumstances 
in which this is not the case (e.g. Hesse et al. 
1937; Hanski 1982; Gaston & He 2002; McGeoch 
& Gaston 2002; Storch & Sizling 2002; Gaston 
2003). Answers to this latter issue have become 
increasingly significant with recognition that at 
broad spatial scales the form of species occupancy 
distributions may be central to an understanding of 
many other macroecological patterns and processes 
(e.g. Gaston 2003; Sizling et al. 2009). 

In this chapter we provide a broad review of the 
present understanding of these three key patterns 
in the study of biodiversity, namely occupancy-area 
relationships, occupancy-abundance relationships 
and species occupancy distributions. In so doing 
we touch on a wide variety of issues pertaining 
to the ecology of the occurrence and occupancy 
of species. Following the bulk of the literature on 
these patterns, and for reasons of clarity of presen¬ 
tation, throughout we largely ignore the influences 
of variation in the detectability of individuals of 
given species or between different species, although 
clearly these can have marked influences on esti¬ 
mated levels of occupancy and abundance, and in 
empirical work will often need to be given careful 
consideration (see Chapter 3). 

11.2 State of the field 

11.2.1 Occupancy-area relationships 

It has long been known that the spatial distributions 
of the individuals of a species are seldom, if ever, 
strictly random or uniform. Rather, they are aggre¬ 
gated on multiple spatial scales, with individuals 
occurring in clusters that are themselves aggregated 
into larger clusters, which in turn are aggregated 
into larger clusters still, and so forth. Many meth¬ 
ods are available for describing such intraspecific 
occupancy patterns (e.g. Taylor 1984; Krebs 1999). 
One key approach is to characterize them in terms 
of simple models linking occupancy and the area. 


a, of a site (here termed occupancy-area relation¬ 
ships, but elsewhere referred to as area-area, p- area 
or scale-area relationships; Kunin 1998; He & Gas¬ 
ton 2000a; Sizling & Storch 2004). Here sites are 
assumed to have some characteristic size or scale, 
usually because species occurrences are mapped 
onto an equal-area grid, from which cells at a 
fine resolution can easily be perfectly aggregated 
to form cells (and thus an occurrence map) at a 
coarser resolution. This is mathematically conve¬ 
nient, although clearly at odds with the marked 
heterogeneity of the natural environment and the 
complexities of the sizes of the set of habitat patches 
in which most species reside. 

A large set of occupancy-area relationships take 
the general form 

where a is the area of the mapping unit, c and z are 
positive parameters, and k is a real number para¬ 
meter in the domain (-oo,-ca z ), or (0, +oo) (He & 
Condit 2007). For a given area and k greater than 0, 
as k increases occupancy also increases (Fig. 11.2). 
For a given area and k less than 0, occupancy 
declines as k assumes progressively greater nega¬ 
tive values. This embraces many of the more famil¬ 
iar occupancy-area relationships (see section 11.2.2 
for more details) as special cases (He et al. 2002), 
such that 

(i) k -»■ ±oo and c = z = 1 gives the Poisson model 

p = 1 - e~‘ (11.2) 

(ii) c = z = 1 gives the negative binomial model 

/ a\~ k 

p = 1 - (l + -) (11.3) 

(iii) k = — 1 gives the power-law model 

p = ca z (11.4) 

(iv) k = 1 gives the logistic model 


and 

(v) k -»■ ±oo gives the Nachman model 

p = 1 — e~ c “ z (11.6) 
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Figure 11 .2 Illustration of model (11.1) as a function of aggregation 
parameter k, where 3 = 0.6. Black curves: c = z = 1; grey curves: c = 1, 
z = 0.5; dashed curves: c = 0.5, z = 0.5. The dashed vertical and 
horizontal lines are the limits of the model. The occupancy p for the 
horizontal line isp = 1 - e~ 0 - 6 = 0.4512, which is model ( 11 . 2 ). 

In all cases, occupancy tends to saturate as the reso¬ 
lution at which this is being documented becomes 
coarser (i.e. as a increases). The linearized forms 
of these models, which are therefore often of 
more value for analytical purposes, are respec¬ 
tively (i) Poisson model: log(l - p) = -a, (ii) neg¬ 
ative binomial model: log(l — p) = —k log(l +a/k), 

(iii) power-law model: log(p) = log(c) + zlog(a), 

(iv) logistic model: log(j^) = log(c) + zlog(a) and 

(v) Nachman model: log[— log(l — p)] = log(c) + 
zlog(fl). Note that the last three of these models 
share a common form in being functions of log(fl). 
For a given set of species' occupancy data, the 
models can be fitted by using either a simple lin¬ 
ear regression (with the exception of the negative 
binomial model, which requires a non-linear algo¬ 
rithm; He & Condit 2007) or a maximum likelihood 
method (He et al. 2002). 

There has been limited comparative testing of 
these different models for intraspecific occupancy- 
area relationships, but it seems unlikely that any 
one of them will remain a consistently better 
descriptor of the patterns for different species 
(He & Condit 2007). However, whilst there is no 
evidence for its inherent superiority, versions of the 
power-law model (p = ca z ) have been particularly 
widely used in examining occupancy-area 
relationships. This is because the nested clustering 
of individuals of a species has led to the view 
that occupancy patterns can best be understood 
in terms of a fractal paradigm (e.g. Lennon et al. 
2007; Sizling & Storch 2007). Whether one strictly 



agrees with this or not, at the very least this may 
be a useful way to think about the consequences 
of a habitat hierarchy and/or a hierarchy of the 
ecological requirements of species in which a 
region is divided into broad environments defined 
by climate, within which are areas defined by 
geomorphology and substrates, divided in turn 
into patches of macrohabitat each comprised of 
different subpatches of microhabitat, and so forth 
(Storch et al. 2008; see also Kolasa et al. 1989). 

The fractal model is more typically described as 

P = Pol°~ z (11.7) 

where po is a constant, l = yja (i.e. I is the length 
of one side of a square mapping unit), and D is 
the box-counting fractal dimension (Lennon et al. 
2007). In a stochastic fractal, the probability that 
a particular sub-area of a larger occupied area is 
occupied is a constant (i.e. it is scale independent), 
and D is therefore a constant (p 0 is also expected 
to be unity, although an empirical fit often allows it 
to vary; Lennon et al. 2007). Empirical studies have 
shown variable fits to a stochastic fractal model (e.g. 
Kunin 1998; Ulrich & Buszko 2003; Hartley et al. 
2004; Sizling & Storch 2004; Lennon et al. 2007). 
Moreover, whilst a constant D may be a mathemat¬ 
ically useful property, there is no necessary reason 
why it need in reality be constant (Sizling & Storch 
2007) and D is often found to decrease with the 
increase of spatial scale (Z) (He & Condit 2007). A 
variety of approaches have thus been taken to relax 
this constraint (e.g. Hui & McGeoch 2007b; Lennon 
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et al. 2007; Storch et al. 2008). The approach of gen¬ 
eralized fractals seems to be particularly promis¬ 
ing (Sizling & Storch 2007; Storch et al. 2008). This 
retains the essential properties of fractals but does 
not necessitate strict self-similarity by randomly 
replacing an original set of areas by sets of smaller 
patches nested within them following a particu¬ 
lar repeated process. Other related approaches to 
modelling occupancy-area relationships (and often 
related biodiversity patterns) have also been taken 
based on simple rules of the occupancy of nested 
grid cells (e.g. Harte et al. 2005; Conlisk et al. 
2007). 

11.2.2 Occupancy-abundance relationships 

The general model for occupancy-area relation¬ 
ships described above (equation 11.1) is a simple 
modification of, and was originally derived from, 
a general model for occupancy-abundance relation¬ 
ships, making the assumption that mean density (/u) 
in the latter can be replaced by area (a) in the former 
(He & Condit 2007). Thus the general model for 
relationships between occupancy and abundance 
takes the form 

P = 1 -( 1 + Cj f) k (11 ' 8) 

or 

= * [(1 - p)~ 1/k - l] V2 (11.9) 

(He et al. 2002). The associated variance-mean rela¬ 
tionship, of considerable ecological interest itself 
(Gaston 2003), takes the form 



where a 2 is the (spatial) variance in abundance 
across areas (Holt et al. 2002a). 

The special cases of this general model, derived 
under the same conditions as before (section 11.2.1), 
are generally more familiar in the context of 
occupancy-abundance relationships, many having 
been standard ecological fare for a very long time. 
The Poisson model (p = 1 — e ,J ) reflects the case in 
which individuals are distributed in space at ran¬ 
dom, and is widely adopted as a null model for 
occupancy-abundance relationships (Wright 1991). 


However, as we have seen, the circumstances under 
which this actually occurs are sufficiently scarce 
that the utility of such a null is very doubtful, and 
it is usually rapidly dismissed as a fit to real eco¬ 
logical data except when species are very scarce 
(e.g. Pielou 1977; Taylor et al. 1978; Greig-Smith 
1983; Gaston 1994; Brown et al. 1995; Hinsley et al. 
1996; Venier & Fahrig 1998). The usual explanation 
for this outcome is that there are numerous abiotic 
and biotic reasons why species are unlikely to be 
randomly distributed. However, it should not be 
forgotten that because such a distribution is only 
one among a large continuum of possibilities it may 
itself be exceedingly unlikely to occur on a simple 
probabilistic basis (Taylor 1961). 

The negative binomial model (p = 1 — (l + f) ) 
is the most frequently used to describe aggregated 
spatial patterns of the occurrence of a species (He & 
Gaston 2000a). However, although a large number 
of possible causal derivations have been identi¬ 
fied, mostly based on the compounding of random 
processes (e.g. Boswell & Patil 1970; Taylor 1984), 
its suitability as a general descriptor of the spatial 
distributions of species has been much debated. 
Even where the negative binomial provides a rea¬ 
sonable fit to observed distributions of abundances, 
appropriate values of k are dependent on the mean 
density (/*; Finch et al. 1975; Taylor et al. 1978, 
1979; Nachman 1981; Taylor 1984; Perry & Tay¬ 
lor 1985, 1986; Shorrocks & Rosewell 1986; Has¬ 
sell et al. 1987; Rosewell et al. 1990; Feng et al. 
1993). The negative binomial model has a variance- 
mean relationship of the form a 2 = fx + (^j- ^ (Rout- 
ledge & Swartz 1991; Perry & Woiwod 1992; 
Gaston & McArdle 1994). 

The power-law model (p = Cfx z ) (Leitner & Rosen- 
zweig 1997) follows a (positive) binomial distrib¬ 
ution describing a regular distribution of species. 
It has a variance-mean relationship of the form 
a 2 = Cfi 2 (1 — Cj/), with C)i z < 1. The power model 
is suitable for species of less aggregated or regular 
distribution, with the level of occupancy for a given 
abundance being greater than that for the Nachman 
model (below). 

The logistic model (p = Wr-r) (Hanski & Gyl- 
lenberg 1997) describes the situation in which the 
occurrence of individuals of a species follows a geo¬ 
metric distribution (He & Gaston 2000a). Departure 
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from this distribution is captured by c and 2 /= 1. It 
has a variance-mean relationship of the form u 2 = 
Cju 2 (1 + C/j. z ), which is larger than the variance in 
Taylor's power model (see below), suggesting that 
the logistic model is appropriate to describe pat¬ 
terns for species having stronger aggregation than 
that under the Nachman model. 

The Nachman model (p = 1 — e~ Cf,z ) (Nachman 
1981,1984) is suggested as an empirical generaliza¬ 
tion of the Poisson model with c and z /= 1 repre¬ 
senting a departure from the Poisson distribution 
(He & Gaston 2000b). The Nachman model predicts 
a variance-mean relationship of the form cr 2 = C/u 2 . 
This is Taylor's power function (Taylor 1961). It 
is the model that has been used most widely to 
describe intraspecific mean-variance relationships 
in empirical abundance data, although the general¬ 
ity of its suitability has repeatedly been challenged 
(Taylor 1984; Sawyer 1989; Routledge & Swartz 
1991). 

Similar to the occupancy-area models intro¬ 
duced in section 11.2.1, the fitting of occupancy- 
abundance models to empirical data can be done 
using linear (or non-linear) regression methods or 
maximum likelihood methods (He et al. 2002; He 
& Condit 2007). Occupancy-abundance relation¬ 
ships can be documented on the basis of intraspe¬ 
cific or interspecific data (Gaston et al. 2000). Note 
that commonly in the latter case, although less so 
in the former, these are documented with abun¬ 
dance averaged only over occupied sites (/*') rather 
than over all sites (/x). From an ecological perspec¬ 
tive this has been argued to remove the likelihood 
of an 'artefactual' positive occupancy-abundance 
relationship arising (occurrence in a smaller num¬ 
ber of sites will tend to lead to lower ;x simply 
because the number of individuals is divided by 
the total number of possible sites). However, in 
terms of exploring the relationships between dif¬ 
ferent patterns in occupancy and linking these to 
models in the broader statistical literature, it makes 
more sense to express occupancy-abundance rela¬ 
tionships in terms of t±, as we have done here. 

In the case of intraspecific data, an occupancy- 
abundance relationship is the equivalent of plotting 
occupancy at a coarser spatial resolution against 
occupancy at the resolution at which spatial units 
can only be occupied by a single individual of 


the species. This can be done with the data points 
derived either across the same set of sites at dif¬ 
ferent points in time (e.g. Gaston et al. 1998a, 
1999b; He & Gaston 2003; Freckleton et al. 2005; 
Borregaard & Rahbek 2006), or across different 
sets of sites in different parts of the species' geo¬ 
graphic distribution (e.g. Venier & Fahrig 1998; He 
& Gaston 2003). In both cases, positive relation¬ 
ships are most frequently documented, although 
not invariably so, with a likely expected weaken¬ 
ing for more widespread species (for which marked 
changes in abundance may be required, both sta¬ 
tistically and ecologically, to result in detectable 
changes in occupancy). The fit of different forms of 
occupancy-abundance relationships has, however, 
seldom been tested for intraspecific data (but see He 
& Gaston 2003; Sileshi et al. 2006). 

If intraspecific occupancy-area relationships 
were sufficiently similar between species, then pos¬ 
itive interspecific occupancy-abundance relation¬ 
ships would tend inevitably to follow, as these 
would equate to plotting the occupancy achieved 
by a single species at a given coarser spatial 
resolution according to a particular scaling rule 
as its abundance varied. In fact, there is doubtless 
much latitude in how similar intraspecific 
occupancy-area relationships need to be in order 
for positive interspecific occupancy-abundance 
relationships to result. It is thus unsurprising that 
such positive interspecific relationships have been 
documented across a wide range of spatial scales, 
environments (terrestrial, freshwater, and marine), 
and higher taxa, almost invariably using data 
points derived for different species across the same 
set of sites over the same time period (Fig. 11.3; e.g. 
Gaston et al. 1998b; He & Gaston 2003; Selmi & 
Boulinier 2004; Heino 2008; Leger & Forister 2009). 
Indeed, a meta-analysis has shown that typically 
one-third of the variance in abundance between 
species is explained by differences in occupancy 
(Blackburn et al. 2006). The fit of different forms 
of occupancy-abundance relationships has, again, 
seldom been tested, although the general model 
(equation 11.8) tends often to fit very well (Holt 
et al. 2002a; Gaston et al. 2006). 

The formation of occupancy-abundance relation¬ 
ships has most often been explained from the spa¬ 
tial perspective—any factors that would affect the 
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Figure 11.3 Interspecific occupancy-abundance relationships and 
model fits for four data sets, (a) 817 tree species from the 50 ha 
(1000 x 500 m) Pasoh forest plot, Malaysia, at cell size = 10 x 10 m 
(the same data as Fig. Id); (b) 301 tree species from the 50 ha plot, Barro 
Colorado Island, Panama, at cell size = 50 x 50 m; (c) 109 bird species 
from Hertfordshire, England, at cell size = 2 x 2 km (Smith et al. 1993); 
(d) 131 bird species from south-east Scotland, at cell size = 2 x 2 km 
(Murray et al. 1998). The four models are Poisson (solid line), Nachman 
(dashed), logistic (dotted), and negative binomial (longdash). As judged by 
AIC, the logistic model is the best for (a) and (d), while the negative 
binomial is the best for (b) and (c). 

spatial distribution, aggregation, or randomness 
in the distribution of the individuals of a species 
would contribute to occupancy-abundance rela¬ 
tionships (Holt et al. 2002a). These might include 
niche breadth, niche position, and habitat selec¬ 
tion (for reviews see Gaston et al. 1997, 2000; Gas¬ 
ton 2003). A number of empirical studies have 
sought to test the importance of such mechanis¬ 
tic drivers (e.g. Hughes 2000; Cowley et al. 2001; 
Simkova et al. 2002; Heino 2005; Kolb et al. 2006), 
although the great variation in how this has been 
done limits the ease with which broad conclu¬ 
sions can be drawn. A plausible but different argu¬ 
ment is that occupancy-abundance relationships 
can arise from the colonization-extinction equilib¬ 
rium of metapopulation dynamics. According to 
metapopulation dynamics, occupancy (or the inci¬ 
dence function) of a species can be expressed as a 
function of colonization and extinction such that 

p = cT? (mi) 


where C and E are colonization and extinction 
rates, respectively (Hanski 1994, 1997). Whilst C 
and E may be various functions of mean abun¬ 
dance (/a), the most general forms are C = a /A and 
E = e ^~ d , i.e. colonization increases and extinction 
decreases with the number of individuals (Han¬ 
ski 1994). Substituting these two rates into (11.11) 
leads to 

P = 1 - (l + (H-12) 

where c = a/e and z = b + d. Equation 11.8 is a gen¬ 
eralization of equation 11.12, pointing to the con¬ 
nection between metapopulation dynamics and 
occupancy patterns. 

A more interesting approach is to equate equa¬ 
tions 11.8 and 11.11, leading to 

I - ('- t )‘ ~ 1 |1L13) 

This model postulates a relationship between 
spatial aggregation as represented by k and 
colonization-extinction processes. Consider a 
metapopulation with fixed /la. Under this condition, 
when k approaches 0 (i.e. the population is 
extremely aggregated), equation 11.13 equals 0, 
i.e. C/E =0. This means that spatial aggregation 
results if colonization is extremely low or extinction 
is extremely high. This result is consistent with 
observations that species aggregation can be closely 
associated with poor colonization ability (Tilman 
et al. 1997a). 

11.2.3 Species occupancy distributions 

In isolation, understanding of the form of 
occupancy-area relationships provides limited 
insight into the relative frequency with which, 
at a given spatial resolution, the species in an 
assemblage will attain different levels of occupancy. 
A diversity of assumptions have thus been made 
about the variation amongst species in the scaling 
of occupancy with area to enable prediction of 
the shape of species occupancy distributions (e.g. 
Hui & McGeoch 2007a,b; Sizling & Storch 2007; 
Storch et al. 2008). In the main, although not 
exclusively, attention here has focused on very 
fine spatial resolutions (i.e. a is small), at which 
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point the species occupancy distribution equates 
to the species abundance distribution (assuming 
that only a single individual can fit within any 
given spatial unit). This is because of the great 
interest in being able to extrapolate from relatively 
coarse species occupancy data, which are for many 
species relatively easily obtained (although not 
necessarily for rare ones), to abundance data, 
which are much more difficult and expensive 
to acquire (e.g. Kunin 1998; He & Gaston 2000a; 
Tosh et al. 2004; Sara 2008; Conlisk et al. 2009; 
Figueiredo & Grelle 2009). Although species 
abundance distributions are discussed at greater 
length elsewhere in this volume (Chapter 10), it is 
notable that the predictions from occupancy-area 
relationships can provide particularly good fits 
to empirical abundance data (e.g. Storch et al. 
2008). 

Species occupancy distributions will change with 
spatial resolution, such that those at coarser res¬ 
olutions may often be of rather different shapes 
from species abundance distributions. This can be 
simply illustrated by assuming that (i) the species 
abundance distribution follows a log-normal distri¬ 
bution 

1 1 / lnM-y \ 2 

f(x)= V^ x e (11 ' 14) 

where x is abundance and /i and a are the mean 
and variance of log(x), and (ii) the occupancy-area 
relationship takes a Poisson form 

l.=\- e - axlA (11.15) 

A 

where a and A are the spatial resolution and extent, 
respectively, and y is the occupied area. Our interest 
is to find out the distribution of y and how the res¬ 
olution would affect the distribution. By the tech¬ 
nique of random variable transformation from x to 
y, it is easy to show that y has distribution 

<1U6) 

Note here that /(.) is the log-normal distribution 
of equation 11.14. From Fig. 11.4, it is apparent that 
when a is small, the species occupancy distribution 
(equation 11.16) is virtually the same as the species 
abundance distribution. However, as a increases 
the species occupancy distribution shifts toward the 


right. When a increases further, the distribution of 
more and more species saturate the entire study 
area, making the occupancy distribution upturn at 
the far right (right panels of Fig. 11.4). This upturn 
merely reflects the effect of spatial resolution, with 
little biology. Note that if other occupancy-area 
models are used in place of the Poisson model the 
species occupancy distribution (equation 11.16) can 
have multiple modes. 

Typically, the number of species with low occu¬ 
pancy will increase as spatial resolution increases, 
and also will increase with the spatial extent of 
a study and sample number; the opposite will be 
true for species with high occupancy (McGeoch 
& Gaston 2002). However, because the details 
of occupancy-area relationships differ between 
species, the picture may become rather more 
complex than this. Indeed, although the potential 
for marked variation in species occupancy distri¬ 
butions with spatial resolution has been shown to 
arise both from relatively simple occupancy-area 
models (Hui & McGeoch 2007a, 2007b) and empir¬ 
ically (Collins & Glenn 1997; Guo et al. 2000; van 
Rensburg et al. 2000; McGeoch & Gaston 2002), 
it remains an open question to what extent the 
wide range of observed forms of species occupancy 
relationships can be explained on this basis alone. 
McGeoch & Gaston (2002) observe that across a 
diversity of spatial resolutions and extents docu¬ 
mented forms include unimodal distributions with 
peaks at low, intermediate, or high levels of occu¬ 
pancy; bimodal distributions with dominant peaks 
at low, high, or both low and high levels of occu¬ 
pancy; and uniform or apparently random dis¬ 
tributions. At the scale of the entire geographic 
ranges of species, species occupancy distributions 
tend invariably to be strongly right skewed (Gaston 
1996a, 2003). 

The importance of spatial resolution for the 
shape of species occupancy distributions may be 
reflected in a number of other mechanisms that 
have been postulated as explanations, including 
those based on habitat and environmental hetero¬ 
geneity (see McGeoch & Gaston 2002). The prin¬ 
cipal alternative set of mechanisms are rooted in 
colonization /extinction dynamics, which may yield 
a wide variety of species occupancy distributions, 
dependent on the assumptions that are made as to 
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Figure 11.4 The effect of spatial scale on the relationship between species abundance distributions and species occupancy distributions. All the 
distributions are plotted on a log scale. The left panels are the log-normal species abundance distributions (model 11.14). The first set of log-normal 
parameters = 3. 12, a 2 = 3.20) were calculated from the tree species abundances of the 50-ha plot on Barro Colorado Island, Panama (for those trees 
of size > 10 cm). In the second row, the variance is arbitrarily set to 14.36 to illustrate how variance affects occupancy distribution. The right panels are 
the respective species occupancy distributions (model 11.16) corresponding to the log-normal distribution. In both cases, the total study area is set to be 
50 ha. The density curves from left to right correspond to the cell sizes a = 1, 50, 250, 900, and 1500 m 2 . 


how rates of site colonization and extinction change 
with overall levels of occupancy and local abun¬ 
dance (Hanski 1982; Gotelli 1991; Warren & Gaston 
1997). 

11.3 Prospectus 

Understanding of the form, determinants, and 
implications of occupancy-area relationships, 
occupancy-abundance relationships, and species 
occupancy distributions has increased markedly in 


recent years and doubtless will continue to do so. In 
this regard, we make four main sets of suggestions: 

(i) Predicting abundance from occupancy: Sub¬ 
stantial success has been achieved in predict¬ 
ing the abundance of a species from its spa¬ 
tial occupancy for study areas of small extent 
(e.g. of a few hectares or a small number 
of sites; He & Gaston 2000a; Sileshi et al. 
2006; Conlisk et al. 2009). However, this is of 
only limited practical benefit, given that it is 
relatively easy to estimate abundance more 
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directly at such scales. Of far more importance 
would be an ability to predict abundance from 
occupancy at scales of hundreds or thousands 
of square kilometres, where direct estimation 
is extremely resource demanding and some¬ 
times virtually impossible. Unfortunately, the 
methods presently available are of limited 
value for this purpose, commonly leading to 
under-estimation of numbers of individuals at 
such scales even by several orders of magni¬ 
tude. A major challenge is how best to develop 
methods that are more appropriate. We doubt 
that occupancy data alone will be sufficient 
for this purpose, but suggest that some com¬ 
bination of broad-scale occupancy data and 
fine-scale abundance data from one or a few 
small areas are probably needed. Together, 
these will provide a hierarchical understand¬ 
ing of species distribution, which may provide 
a much more secure basis for the upscaling of 
abundance to large geographic areas. 

(ii) Detectability: As stated at the outset, for ease of 
presentation throughout this chapter we have 
ignored issues of differences in the detectabil¬ 
ity of species in sites, and particularly the 
frequent under-estimation of levels of occu¬ 
pancy and abundance that result from indi¬ 
viduals being overlooked during field sam¬ 
pling. Increasingly sophisticated methods of 
improving the estimation of both variables are 
available (see Chapter 3). From the perspective 
of the estimation of abundance from occu¬ 
pancy, of particular interest is the modelling of 
detectability probabilities based on time series 
data on occupancy (MacKenzie et al. 2002, 
2003; Royle et al. 2005). Consideration of such 
approaches highlights that the effective eval¬ 
uation both of existing and novel methods of 
predicting abundance may require better esti¬ 
mations of actual occupancy than have in some 
cases thus far been employed. 

(iii) Temporal dynamics: More generally, the con¬ 
tents of this chapter have provided a rea¬ 
sonable reflection of the ecological literature 
at large in focusing primarily on occupancy- 
area relationships, occupancy-abundance rela¬ 
tionships and species occupancy distributions 
based on estimations of the occupancy of a 


species at a given time or averaged over a 
given period, and in the main ignoring the 
temporal dynamics of occupancy (but see Gas¬ 
ton & Lawton 1989; Webb et al. 2007; Heino 
2008; Zuckerberg et al. 2009). However, partic¬ 
ularly in light of the rapid changes in the dis¬ 
tributions of species being wrought as a conse¬ 
quence of anthropogenic pressures (including 
climate change), a fuller understanding of 
these patterns is undoubtedly required. Of 
particular value would be much improved 
knowledge both of how the different patterns 
change through time and of how changes in 
the occupancies of species modify the observed 
patterns. 

(iv) Mechanisms: Among the many factors that can 
potentially affect species occupancy, species 
abundance and the spatial distribution of that 
abundance are the most immediately signifi¬ 
cant. Other factors do so through their effects 
on these variables. Given the numbers of pos¬ 
sible such variables, and the indirect and scale 
dependent nature of their effects, their influ¬ 
ence may often be difficult to detect other 
perhaps than under highly controlled cir¬ 
cumstances (e.g. microcosms; Gaston & War¬ 
ren 1997; Warren & Gaston 1997; Holt et al. 
2002b, 2004a). This may well in large part 
explain the great controversies that have raged 
around the possible influences of niche breadth 
and position and dispersal abilities on lev¬ 
els of occupancy. In consequence, we do not 
expect that a single coherent mechanism will 
be determined that gives rise to occupancy 
patterns. 

11.4 Key points 

1. Occupancy is a fundamental measure of the 
distributions of species in landscapes. It is a 
function of their abundance and spatial dis¬ 
tribution. A locally abundant species typi¬ 
cally has higher global occupancy, and vice 
versa. On the other hand, spatial aggregation 
reduces occupancy. These two general qualita¬ 
tive patterns are described by the occupancy- 
abundance and occupancy-area relationships. 
Although many potential mechanisms give rise 



SPECIES OCCURRENCE AND OCCUPANCY 151 


to these relationships, colonization/extinction 
dynamics play a key role in determining 
occupancy. 

2. Occupancy is scale dependent. The scale not 
only refers to the spatial resolution at which 
a species is mapped, but also to the spa¬ 
tial extent over which the species is studied. 
Because of the effect of scale, the measurement 
and quantification of occupancy can become 
complex. Comparison of occupancies for two 
or more species is irrelevant unless the spa¬ 
tial scales are compatible. Ideally, this should 
be done using occupancy-area curves, which 
describe the change of occupancy across the full 
range of spatial resolutions. 

3. The most important application of the 
occupancy-abundance relationship is to 
estimate species abundance from occupancy. 
Present methods have met with a certain degree 
of success for relatively small spatial extents. 
Their usefulness in estimating abundance at the 
regional scale remains to be tested, although it is 


likely that they will substantially under-estimate 
regional abundance. It will be a major break¬ 
through if methods can be developed to estimate 
abundance from regional occurrence data. 

4. Occupancy is the basic unit underlying several 
macroecological patterns, including species- 
area curves and gradients in (3 diversity. 
Assuming p,(a) is the occupancy probability 
of species i in a site of size a, then the species- 
area curve is simply the sum of probabilities 
across all species. 3 diversity can also be 
formulated by a conditional probability through 
occupancy: given a species is present in an 
area, what is the probability that the same 
species will occur in another area at distance d 
away? 
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CHAPTER 12 


Measuring the spatial structure 
of biodiversity 

Brian J. McGill 


12.1 Introduction 

Tobler (1970) coined the first law of geography: 
'everything is related to everything else, but near 
things are more related than distant things'. Put 
in other words, the second half of Tobler's law 
is 'far things are less related' or just 'everything 
varies across space.' Biodiversity certainly follows 
this law. It is well known that diversity varies lati- 
titudinally with the highest diversity in the trop¬ 
ics (Rosenzweig 1992; Kreft & Jetz 2007), but vari¬ 
ation in biodiversity occurs at all spatial scales 
(Lennon et al. 2001; Rahbek & Graves 2001). The 
abundance of an individual species also varies 
across space (Brown et al. 1995); even whether 
a species is present or absent varies across its 
range (Chapter 11). Of course this should come as 
no surprise because the underlying environmental 
variables such as elevation, temperature, and soil 
properties also vary across space. How can such 
variation in biodiversity across space be quantified 
and understood? 

For a long time the main method was to develop 
a checklist—a list of all species found within 
a geographic entity like a park, county/shire, 
state/province, or country. But this approach is not 
particularly strong in capturing the spatial struc¬ 
ture. In recent times, there have been increasing 
numbers of spatially dense biological surveys, espe¬ 
cially surveys of birds and trees conducted by 
national governments. With the advent of GIS tech¬ 
nology it is possible to display this data in richly 
coloured maps (Kreft & Jetz 2007). This allows 
visual inspection of patterns. This book, however, 
is mainly focused on the quantification of biodi¬ 
versity, which requires going beyond maps. Recent 


decades have seen an explosion of the field of spa¬ 
tial statistics (Cressie 1992; Fortin & Dale 2005). 
Although spatial statistics received much of its 
development in applications to mining (ore con¬ 
centrations measured across space) the tools are 
applicable to a broad range of fields, including soci¬ 
ology and epidemiology. More relevantly, spatial 
statistics has a long history of use in the measure¬ 
ment of the biodiversity and abundance of organ¬ 
isms, which is summarized here. 

Unfortunately, there has been a great prolifera¬ 
tion of spatial statistical techniques, so starting out 
in spatial statistics can be overwhelming. Books 
on spatial statistics, whether targeted at statisti¬ 
cians (Cressie 1992) or ecologists (Perry et al. 2002; 
Fortin & Dale 2005), list dozens of different tech¬ 
niques. There have been a few attempts to find links 
between the methods (e.g. Dale et al. 2002), but 
there has been distressingly little effort to empha¬ 
size the underlying unity of methods to ease the 
learning curve for beginners. In this chapter I will 
adopt the attitude that details of calculations can 
be left to software and emphasize the concep¬ 
tual unity of and the interpretation of results for 
the different methods. To begin to navigate the 
plethora of spatial statistics methods, I lay out three 
main axes of importance: what spatial structure is 
being explored, pattern vs. association, and type 
of data. 

12.1.1 What spatial structure is of interest? 

Modern spatial statistics has converged around the 
model: 

z(s) = fi(s) + v (s) + e 
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where z(s) is the variable of interest (e.g. 
biodiversity, abundance of a target species), q.(s) is 
the average value of z in the area of spatial location 
s, £ is noise or measurement error, which does not 
depend on spatial position and is independent 
(uncorrelated) across space, and r|(s) is variability 
about the mean that is spatially structured (i.e. 
correlated across space). It is common to refer to pi 
as a first-order effect because it looks only at one 
point in space at a time and 77 as a second-order 
effect because it, by definition, depends on two 
points in space at once. I extend this semantics to 
refer to e as a zeroth-order effect since it does not 
depend on space at all. Fig. 12.1 explores what the 
variation in z across space is like with different 
combinations of zeroth-, first-, and second-order 


effects. Conveniently, and not coincidentally, this 
mathematical organization matches nicely with the 
two most common questions one asks about the 
spatial structure of biodiversity. 

Question 1: First-order effects—is there variation 
across space? 

The most basic and most useful question is whether 
there is systematic change in the variable of interest 
across space. For example, are there regions where 
diversity or the abundance of a target species is 
particularly high or particularly low? From a basic 
research point of view this is interesting because 
it raises the question of 'why?' Questions of what 
controls diversity and abundance are central to 
ecology and exploring how this varies across space 


(a) 


(c) 


No effects 



First order effects only 




(b) 


(d) 


(f) 


Zeroth order effects only 



Zeroth & first order effects 



First & second order effects only 



Figure 12.1 Different order effects in spatial modelling, (a) No effects: the measurement variable (possibly abundance of a species, possibly species 
diversity, possibly soil moisture) is entirely constant, (b) Only zeroth-order (measurement error, innate variability or noise) effects. The variable is on 
average constant but there is variability. The variability is completely independent, even between adjacent points, (c) First-order effect: a systematic change 
in the mean of the variable across space. Here a simple linear trend is modelled, but of course if the mean is tracking some underlying variable like soil 
depth the system can look rugged and irregular even with only first-order effects, (d) Combined zeroth- and first-order effects. Here there is variability due 
to zeroth-order effects overlayed on top of the same first-order trend in the mean found in (c). (e) Second-order effects only. The mean is constant across 
space and no measurement error is modelled, but there is autocorrelated variability. When one point is above the mean, adjacent points are more likely 
(although not guaranteed) to also be above the mean. This starts to give rise to coherent features that are spread across several points, like the peak in the 
back corner or the trough running across the front corner, (f) Second-order effects (e) overlayed on top of first-order effects (c). 
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is one of the easiest ways to study these questions. 
From a management point of view, identifying spe¬ 
cific areas of high abundance or diversity can lead 
immediately to management decisions such as the 
location of reserves in biodiversity hotspots (Pren- 
dergast et al. 1993; Reid 1998) or minimizing human 
impact from roads and buildings in regions of high 
abundance for an endangered species. Statistically, 
these questions all map back to questions about 
first-order processes—changes in the intensity of a 
process (the mean value of the variable measured, 
e.g. biodiversity) across space. 

Question 2: Second-order effects—do distant 
points have effects? 

Imagine we have measured the biodiversity at loca¬ 
tion A. Do we attempt to explain the level of bio¬ 
diversity at A solely as a function of exogenous 
variables (temperature, soil, etc.) at A? Or is biodi¬ 
versity at A in part a function of biodiversity else¬ 
where in the landscape? If a nearby location B is 
extraordinarily high in biodiversity might there be a 
spill-over effect (perhaps B serves as a source and A 
as a sink, elevating the diversity at A beyond levels 
the environment there would otherwise support). 
If B is having a spill-over effect, might location C, 
which is twice as far away, also be having a spill¬ 
over effect on A? Exactly how far from A can we get 
and still credibly have a spill-over effect? Similarly, 
for a single species with a given abundance at A, 
is this abundance only a function of the properties 
of location A or might the abundance at site B also 
have an effect? These types of questions are about 
interactions at a distance. In technical terms, they 
are question about second-order effects in spatial 
statistics. When interactions at a distance are posi¬ 
tive they tend to create clumping (Fig. 12.3b), while 
negative interactions tend to create regular distri¬ 
butions (Fig. 12.3c). Interactions can be positive at 
some scales and negative at other scales (Fig. 12.3e 
and f). 
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Figure 12.2 Different types of spatial data, (a) Point data from Barro 
Colorado Island's tropical tree study. This map shows the location of 
individual trees of the species iicania platypus (with a diameter at 
breast height of 1 cm or greater). Elevation contours are plotted as well, 
showing the plant is biased towards slopes (Condit et al. 2000; Harms 
et al. 2001). (b) This plot shows how the species richness of trees varies 
across the Mt St Hilaire nature reserve. The data are collected on a regular 
grid of 50 x 50 m cells, (c) This plot shows how the abundance of an 
individual species of a bird (the scissor-tailed flycatcher) varies across 
space (notice that the highest abundances are in Texas and Oklahoma, but 
the range extends all the way east to the Carolinas). The white dots 
represent routes where the birds were surveyed and are placed irregularly 
in space. Data from the North American Breeding Bird Survey (Robbins 
et al. 1986; Patuxent Wildlife Research Center 2001). 


12.1.2 Number of variables recorded - 
pattern or association? 

As summarized in the last section, to determine 
which spatial statistical techniques to use, we need 
to know whether we are studying first-order or 
second-order effects. We also need to know whether 


one is looking for patterns in one target variable 
of interest (such as species richness or abundance 
of a target species) or if one is recording multi¬ 
ple variables (such as other species or environmen¬ 
tal variables) and attempting to find associations 
between the variables. With multiple variables. 
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finding associations can inform about the causes 
of or lead to predictive models about a variable of 
interest. Thus, one can build a spatially informed 
model of which environmental variables increase 
species richness. Or one can build a predictive 
model about which habitat factors are associated 
with high abundance for an endangered species. 
Such associational analyses have traditionally been 
done in a non-spatial context, treating measure¬ 
ments across space merely as replicates and using 
methods such as regression to find the patterns. 
There is, however, growing awareness that if the 
replicates are across space then are a number of spe¬ 


cial statistical challenges and opportunities. These 
are discussed in section 12.2.3 below. 

12.1.3 Types of data 

Sections 12.1.1 and 12.1.2 lay the main conceptual 
axes along which geospatial statistics are organized. 
However, there is also one practical consideration. 
What type of data is collected? Data on biodiversity 
across space can take many forms (Fig. 12.2, 
Table 12.1). As in other areas of statistics, different 
techniques are required when the data structure 
differs, even if fundamentally similar questions are 


(a) Poisson random 



(b) Neyman-Scott clustered 
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Figure 12.3 Different kinds of second-order (point interaction) 
processes give rise to different spatial patterns, (a) A Poisson random 
process, (b) An example of clustering (attraction between points) 
simulated by the Neyman-Scott process, (c) An example of regular or 
over-dispersion (repulsion between points) simulated by the Mattern 
hard-core process, (d) A process where the locations are Poisson 
random, but the points are marked by two features: size (e.g. tree 
diameter) and black/gray (e.g. species), (e) An example of 
scale-dependent effects: random at small scales, clumped at larger 
scales, and over-dispersed at very large scales, (f) Another example of 
scale-dependent effects which appear regular at small scales but random 
at large scales. 
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(e) Scale-dependent effects 
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Table 12.1 Major categories of spatial data (section 12.1.3). 


Spatial sampling design 

Binary variable 
(present/absent) 

Continuous variable 
(amount/abundance) 

Sampler controlled 

Regular across space 

'Presence/absence quadrat' 

Quadrat 


Random across space 

'Presence/absence geospatial' 

Geospatial 

Biology controlled 


Point-process 

Marked point-process 
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asked. The single most important question is the 
nature of the spatial structuring of the data and is 
described by the dichotomous key below: 

(a) The locations recorded are controlled by nat¬ 
ural events -*■ individual point-process data. 
Examples include the locations of tree trunks 
in forest, bird nests, or lightning strikes 
(Fig. 12.2a). 

(b) The locations are chosen by the scientist in the 
design of the sampling protocol. 

i The locations are spaced regularly across 
space -> quadrat data. Examples include 
quadrats arranged across a line (transects) 
and quadrats arranged across a rectangular 
area. In addition to the regularity of the data 
there is also an assumption that some area 
(line or rectangle) has been exhaustively cov¬ 
ered (Fig. 12.2b). 

ii The locations are spaced irregularly, per¬ 
haps apparently randomly across space -» 
geospatial data. Examples include counts of 
abundance of a target species in quadrats 
where the quadrats are scattered across a 
large area or soil moisture at irregular spac¬ 
ing across an area. The location may be lit¬ 
erally random (a randomization was used in 
the protocol) or may be just irregular, with 
the actual locations driven in part by accessi¬ 
bility to roads and trails, etc. (Fig. 12.2c). 

Note that there is a gradient as one proceeds 
from the top (a, individual point processes) to the 
bottom (b-ii, geospatial data). As one moves down, 
the amount of information decreases, but the ease 
of obtaining the information increases. Type a data 
can be converted to type b-i data, which can be 
converted to type b-ii data but not in the other direc¬ 
tion. 

Although it will not affect the type of analysis 
done, many readers will be aware that the spatial 
scale at which the data are measured can have pro¬ 
found implications for the results found and the 
types of questions that can be asked. The scale of the 
data is usually described by two terms'. The grain is 
the scale of a single measurement. Thus if a single 
measurement is a 1 x 1 m quadrat (as is common in 
grasslands), then the grain is 1 m (or 1 m 2 depend¬ 
ing on if lineal or areal measurements are used). The 


extent is basically the distance between the farthest 
apart measurements. This can range from metres to 
thousands of kilometres. The number of sampling 
units, N, is another key attribute of any survey. 
These three factors, if grain and extent are measured 
in areal units, can be combined to describe the cov¬ 
erage of the study (coverage = N x grain/extent, 
giving the percentage of the study area actu¬ 
ally measured). In point-process and quadrat data 
(types a and b-i above) it is often assumed that 
coverage is 100% while the geospatial data survey 
method is often chosen when the resources avail¬ 
able for survey are inadequate to provide 100% cov¬ 
erage of the area of interest. The results of analyzing 
data can depend heavily on the scales involved (see 
Fig. 12.3e and f, described later for an example). 
There is no such thing as a best scale for grain or 
extent. The goal is that the scales should be chosen 
to match the question at hand. Thus, if one is study¬ 
ing bird dispersal, then the grain should be small 
relative to the dispersal distance but the extent 
should be large relative to the dispersal distance. If 
one is studying tree mortality, which is a rare event, 
then the total area covered (N x grain) needs to be 
large while the extent should match the question 
(is one making an assertion about mortality across 
the geographic range of a species or in a specific 
park?). 

In summary the combination of: (i) two types of 
question (section 12.1.1), (ii) two options for num¬ 
ber of variables recorded (section 12.1.2), and (iii) 
the three types of data (section 12.1.3), from a 2 x 
2x3 grid to determine what types of spatial analy¬ 
sis tools can and should be performed (Table 12.2). 
If one has a spatial dataset but doesn't know what 
types of analyses to perform, one should start by 
determining where in this 2x2x3 grid the data 
and the questions of interest fall. 


12.2 State of the art 

Despite the seeming complexity of 12 cells and an 
implied 12 types of geostatistics, there are really 
only three basic approaches: one must decide if one 
is looking at first-order effects, second-order effects, 
or associations between two variables. From there a 
single conceptual approach will apply, although the 
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Table 12.2 Common spatial statistics applied depending .on the type of spatial data available, the question asked, and the number of variables 
measured. 




Point locations 

Quadrat counts 

Geospatial 

One variable only 

What is intensity 

Grid, kernel smoothing 

Local smoothing 

Local smoothing (with 


(interpolation and 
smoothing) 


Trend surfaces 

interpolation) 




Kriging 

Trend surfaces 

Kriging 


What is effect at 

Ripley's K 

Moran's 1 

Covariograms, etc. 


distance? Are they 
aggregated? 


Mantel 

Spectral 

Mantel 

Two variables 

What controls intensity 

Intensity regression 

Spatial regression 

Spatial regression 

(dependent, 

independent) 

Interactions occur at 

Cross K 

Mantel 

Cross-variogram 


distance 



Mantel 


details of the exact procedure will vary depending 
on the type of data. 


12.2.1 Estimating intensity (first-order 
effects) 

One central goal of spatial analysis (question 1) is 
simply to create a map of intensity across space 
where intensity is the value of the variable of inter¬ 
est such as species richness or target species abun¬ 
dance). This then allows exploration of what fac¬ 
tors cause high vs low intensity and management 
decisions based on areas that are favourable (e.g. 
high intensity of the target organism). Two issues 
arise in creating such maps. First, if coverage is 
less than 100% (i.e. the area sampled is a fraction 
of the area of interest, typically the geospatial data 
case), then interpolation is needed to make a pre¬ 
diction about the areas that are not sampled. Sec¬ 
ond, whether the coverage is 100% or not, there is a 
goal to remove the zeroth-order noise and find the 
underlying 'true' signal. One would hate to locate 
a critical management resource (e.g. extra food) at 
a site to find out it appeared to have a high density 


due to chance on the day the survey was done with 
no unusually high abundance ever observed since. 
This process of removing noise is called smoothing. 
Usually one technique accomplishes both smooth¬ 
ing and, if needed, interpolation. We discuss four 
broad classes of techniques that can be used. 

• Local surfaces (smoothing): Many techniques can 
be grouped together as smoothing (Fig. 12.4b). 
For any point where a prediction is needed, 
the predicted value is a weighted average of 
nearby observed points. If the prediction is to be 
made where an observation was made then only 
smoothing is involved, but if the prediction is for 
where no observation was made then interpola¬ 
tion is also occurring. Typically nearer points are 
given a greater weight, to,-. This can be captured 
in the equation: 

. _ E.'eS W - Z - 

~ V ' 

£i<=s w 

The predicted value z at point p is a weighted 
average of the observed values z, over some sub¬ 
set, S, of all the observed values. The simplest case 
is nearest neighbour interpolation where id,- = 1 
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Figure 12.4 Example output from analyses of data discussed in the text, (a) A trend surface (global smoothing) fitted to the data of Fig. 12.2b. 

Generally, diversity increases from west to east which also matches a transition from active human use to a protected area, (b) The same data 
kernel-smoothed (Gaussian kernel) and plotted in a coloured contour plot. Considerable variation occurs with peaks on the north and east edges of the 
lake (large empty region in the centre), (c) Four types of ‘grams' on a theoretical data set showing the rescalings that occur, (d) Correlograms on 
theoretical data (Fig. 12.1) with zeroth-, first- and second-order effects, and then actual data from Fig. 12.2b. Note that the zeroth-order effect shows 0 
correlation at all distances—there is no spatial interaction. The first-order effect starts with a very high correlation at short distances and goes to a very 
negative correlation at long distances due to the trend surface. The second-order effects start with a fairly high correlation in nearby sites, which drops to a 
negative correlation at the distance at which peaks to valleys are compared and then fades to zero correlation at long distances, indicating no interactions. 
Thus this analysis gives a clear indication of the scales of interaction. The actual data show a low, but positive, correlation at short distances fading out to 
zero correlation at longer distances. 


if the point i is the closest point to p and to,- = 0 
otherwise. Moving average is another technique 
on gridded data where w , = 1 for the cells within 
h cells of the target cell p. A similar approach 
used on geospatial data is the moving window, 
where a box with sides of length h are drawn 
around each point and u;, = 1 if the point i is in 
the box and zero otherwise. Similar techniques 
can be used where points are differentially down¬ 
weighted the further away they are. Exponential 
smoothing, where to, = exp (—hdi p ), where d ip is 
the distance between i and p, is a common choice. 
The most modern and probably best technique 
is known as kernel smoothing, which weights 
the points according to a symmetric probability 
distribution, k(d,h), centred around the point p 
and where ui, = k[\\p — i\\/h]. A two-dimensional 
Guassian bell curve is a common choice for k, so 


Wi = exp (—df p /h 2 ), as is the quartic kernel, uz, = 
3(1 — d 2 /h 2 ) 2 /n when d < h and 0 if d > h. In the 
quartic case any point further than bandwidth h 
away has no effect whereas in the Guassian it has 
an increasingly small effect. Notice that in all of 
these scenarios the specification of a smoothing 
parameter h is required where h is a measure of 
the distance at which effects are still important. 
As h increases more smoothing occurs. In the limit 
when h is very large a completely flat surface 
(Fig. 12.1a) will be produced. 

• Global surfaces (trend surfaces): In contrast to 
smoothing approaches, which are local in nature, 
a trend surface is a global prediction that can 
be summarized by relatively few parameters. A 
trend surface is described by a function: 

z P = y(p)\9) 
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where x(p) and y(p) give the x and y coordinates 
(possibly latitude and longitude) of the point and 
9 is a set of parameters (Fig. 12.4a). The func¬ 
tion / can vary from simple to quite complicated. 
The simplest trend surface to fit is a linear model 
where / is a plane (i.e. z p = + j 8\y + c). Such 
fitting is usually done by a least-squares crite¬ 
rion and is basically just a two-variable regression 
with the independent variables being spatial coor¬ 
dinates. The next step up is to use a polynomial. 
A quadratic polynomial would be z p = ax + bx 2 + 
cy + dy 2 + exy + /. A quadratic polynomial yields 
a three-dimensional parabola and as such can 
only represent a unimodal (one-peaked) surface. 
More commonly cubic or quartic polynomials are 
used. The number of coefficients that need to be 
estimated goes up quickly with degree (linear = 3, 
quadratic = 6, cubic = 10, and generally a polyno¬ 
mial of degree p has (p + 1 )(p + 2)/2 coefficients). 
Care must be used. Fitting a quartic (p = 4) poly¬ 
nomial with only 20 data points would leave only 
5 (20-15) degrees of freedom, which is probably 
too low to be reliable. A variety of functions with 
/ more complex than polynomials can also be 
used. For example, if one is modelling a surface 
that varies periodically, sine or cosine functions 
may be appropriate. 

• Kriging: The first two techniques (local and 
global surfaces) deal only with first-order effects. 
However, second-order effects can also impact the 
predicted values z p . Over very large distances the 
second-order effects should decay to zero and a 
surface based on first-order effects will be unbi¬ 
ased (i.e. be accurate on average). However, on 
smaller scales second-order effects can signifi¬ 
cantly affect the predicted values. Knowing that 
the variable of interest (e.g. abundance) is unusu¬ 
ally high at a point p implies that it will usually 
be unusually high in a neighbourhood around 
p. First-order processes take this into account 
to some degree (small bandwidths h and highly 
flexible global surfaces / are strongly influenced 
by local conditions). However, a second-order 
effect is probably a more accurate way to incor¬ 
porate this information and allows us to make 
inferences about the nature and distances of inter¬ 
actions. The means to do this is called kriging. 
Kriging also has the statistically desirable prop¬ 


erty of being a minimum variance estimate. Krig¬ 
ing is the attempt to produce more accurate pre¬ 
dictions of intensity by combining first-order and 
second-order analyses of the data. Further dis¬ 
cussion of kriging is postponed to section 12.2.2, 
where second-order effects are covered in detail. 

• 'Let it all hang out': One approach is to not 
smooth or interpolate but simply to report the 
data as it exists. Press and colleagues argue for 
just this approach (2007). The human mind is nat¬ 
urally good at interpolation. It is only moderately 
good at smoothing, but certainly it can detect and 
dismiss outliers. 

In comparing these techniques, three things are 
worth noting. First, the global trend surface 
approach is effectively making a prediction for all 
points in the area of interest. The smoothing and 
kriging techniques only make predictions for a 
specified set of points p e P. This set can either 
be a set of target points of interest (e.g. locations 
under consideration for management decisions) or 
it can just be a regularly spaced grid of fairly fine 
resolution placed across the area of interest. Once 
a trend surface or a grid of predicted points is 
in hand, the data can be plotted in a number of 
ways. Three-dimensional surface plots (12.4a) are 
most commonly used for trend surfaces. Contour 
maps are probably the most common way to plot 
smoothed data. In either case colour (or greyscale) 
intensity can be overlayed. 

Secondly, of the four techniques for estimating 
intensity surfaces, the fourth is the only one directly 
applicable to point-process data. To estimate inten¬ 
sity surfaces across space from point-process data 
(e.g. density of tree trunks), two choices are avail¬ 
able. The first is to lay a grid over the data and 
then count the number of points in each grid 
cell, thereby moving from type i data to type ii- 
a. Then any of the four techniques can be used. 
This entails a loss of information and works best 
when the points are fairly dense in space. Sec¬ 
ondly, the local surface method can be modified 
where instead of using z, the observed intensity 
at a point, one uses counts of points (replace z, 
with «, ) and adjust the weights by area so w p = 
E?=i (^V/a P k ( i T H ) du with the intensity 
at a point p basically being a density weighted 
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average of neighbouring points. Neighbour-based 
methods to estimate intensity go back to the early 
days of forestry (Konig 1835), but they are accu¬ 
rate only if complete spatial randomness with no 
second-order effects holds (see Box 12.1) and are not 
generally recommended today (see section 12.2.2 
for a brief description of one of these methods). 

Finally, the first two techniques require the input 
of a smoothing parameter (degree of smoothing). 
The smoothing parameter is implicit as the band¬ 
width, h, in the local smoothing case and implicit 
in the flexibility of the function / (e.g. degree 
of polynomial) in the global surface case. If too 
much smoothing occurs then true variation will 
be eliminated, but if not enough smoothing occurs 
then noise will be represented as pattern. Unfor¬ 
tunately, there is no way to determine the mathe¬ 
matically correct smoothing parameter, despite the 
existence of various guidelines and rules of thumb. 
The smoothing parameter is ultimately a subjec¬ 
tive, human-chosen parameter. The third approach 
(kriging) essentially derives the degree of smooth¬ 
ing from the data itself and must be considered 
to have an advantage for this reason. The final 
approach also avoids a smoothing parameter. Over¬ 
all, kriging has often been avoided due to its per¬ 
ceived complexity, but with software readily avail¬ 
able this argument has lost weight. On the whole 
I would recommend kriging in combination with 
always plotting the 'let it all hang out' approach for 
comparison. 

12.2.2 Studying effects at a distance 
(second-order effects) 

Second-order effects assess how the value mea¬ 
sured (e.g. biodiversity or abundance) at one point 
influences the same property at other points some 
distance away. There are a couple of reasons why 
one might want to study second-order effects. Most 
simply, one might wish to make more accurate pre¬ 
dictions of intensity (i.e. using kriging). A more 
involved reason would be to test whether there 
are effects over distance and whether these effects 
are positive or negative. An even more informative 
question is to look at how these effects vary with 
distance. For example, it is possible that having a 
high abundance at one site makes sites 10-100 m 


away more likely to have a higher than average 
abundance while making sites 0-10 m away more 
likely to have a lower than average abundance. 
This has strong implications for spacing require¬ 
ments, which in turn has implications for minimal 
areas to contain a viable population. From a basic 
research point of view, this then immediately leads 
into research questions about what processes cause 
these effects (dispersal limitation, species interac¬ 
tions, resource competition). Additionally, second- 
order effects are directly related to the goal of mea¬ 
suring whether distributions are clumped or reg¬ 
ular. For example, Gaston and He (Chapter 11) 
describe in detail the study of mean occupancy (per¬ 
centage of patches occupied) but do not analyze the 
spatial structure of the occupied cells, specifically 
whether the occupied cells occur in clumps or not. 
Second-order effects deal with this problem; posi¬ 
tive second-order effects say that knowledge that a 
species is found at one site implies a greater than 
average probability of encountering that species at 
other nearby sites, leading to clumping, while neg¬ 
ative second-order effects imply an unusually even 
spacing (regularity) (see Fig. 12.3). 

• Early approaches: The oldest method for explor¬ 
ing second-order effects is the variance-to-mean- 
ratio used on counts taken in quadrats. This 
test derives from the fact that the variance of 
the Poisson distribution is equal to the mean 
of the distribution, fj .. This is an unusual and 
strong property (compare to the normal dis¬ 
tribution, where the mean and variance are 
completely unrelated). Thus we can examine 
the index of dispersion ID = var(z, )/average(z, ) = 
[E"=i( z - - zD 2 /(w - 1)] / [E,"=i z i! n \ Under the 
null CSR/Poisson assumption (see Box 12.1), 
ID = 1. When ID > 1, then there are more 
quadrats with either very low (near zero) or very 
high (well above average) abundance, which is a 
sign of a clumped distribution (positive or attrac¬ 
tive second-order interactions). Conversely, when 
ID < 1 then most quadrats have z, very close to 
zj, which is a sign of regularity. Conveniently, if 
n > 6 and z7 > 1 then (n — 1) x ID is distributed 
approximately as a chi-square distribution with 
(n — 1) degrees of freedom, which allows for tests 
of whether ID is significantly larger or smaller 



MEASURING THE SPATIAL STRUCTURE OF BIODIVERSITY 161 


Box 12.1 Null models and significance tests 


With the exception of spatial regression (section 12.2.3), 
little has been said about the central concern of most 
statistics: assessing statistical significance. While spatial 
statistics are frequently used in a purely descriptive fashion, 
significance tests can be readily fashioned if one has a clear 
concept of the null model. 

Spatial null models 

The simplest null model is that the variable of interest is 
constant across space (see Fig. 12.1a), but this is not 
particularly useful. Null models need to include statements 
about variability. The traditional spatial null model goes 
under varying names such as 'homogenous Poisson 
process'. The key assumptions of this model (with reference 
back to equation 12.1) are: 

1. First-order: the intensity (expected mean value of the 
target variable) is constant across space or 
homogeneous: /4s) = /*. 

2. Second-order: there are no second-order effects, every 
location is spatially independent of every other location: 
rj(s) = 0. 

3. Zeroth-order: the noise or error is Poisson distributed: 

/x + e ~ Poiss(/zA). 

where /u is average density and A is area. This fully specifies 
a precise mathematical model. Poisson processes have no 
memory—that is to say the probably of an event occurring 
is completely independent of whether an event occurred 
nearby. 

Conveniently this single null model applies to both types 
of data: type a (point process or nature determined 
locations; see Fig. 12.2a) and type b (measurements at 
points or grids; see Fig. 12.2b and c) data. Also 
conveniently, it is trivial to run a computer simulation of 
this null model: 

1) Sample the number of points N from a Poisson 
distribution with parameter Ap (A = area). 

2) Rescale the spatial coordinates to range from 0 to 1 on 
both axes and then sample two random uniform 
numbers over the interval 0-1 (something which nearly 
every programming language provides). 

3) Treat these as the x, y coordinates of the points. 

A final convenient fact is that the probability of a 

non-empty region is given by 

P(k > Q\ixA) = 1 - P(0\fiA) = 1 - e-A 4 . 


The Poisson model gives a good analytical null model. 
When a null (randomization) model on empirical data is 
needed complete spatial randomness (CSR) is used. For 
sampling data (type b data) where a value i is recorded at 
different points (or grid cells), simply reshuffle. Take the 
observed values, z- u and randomly assign them to a different 
point or grid cell, i.e. a shuffling or sampling without 
replacement. This will keep the intensity (/4 constant (point 

1 of the null model) and remove second-order effects (point 

2 of the null model). The noise (-p or point 3 of the model) 
may or may not be Poisson distributed depending on 
whether the original data were, but it is usually considered 
a good thing that the underlying distribution of the data is 
preserved. To simulate CSR from point data (type a data), 
just randomly relocate each point individually (independent 
of the other points). Again assumptions 1 and 2 will be 
met. Thus the Poisson process serves as an analytical null 
model while the closely related CSR serves as a 
Monte-Carlo or randomization null model for empirical 
data. 

Statistical significance on grams and K(d) and 
g(d) statistics 

There are several ways to test the statistical significance of 
a gram (most commonly whether a correlogram is 
statistically significantly different from zero at some 
distance, d). The distribution of Moran's / is known 
analytically (Box 12.2 and Legendre & Legendre 1998), 
allowing for a simple analytical test of significance. More 
generally, a randomization test can be performed. Reshuffle 
the data as described above to convert the data to a 
sample from the CSR model. Calculate the correlogram that 
results. Repeat this, say, 999 times and draw the 95th 
perecentile envelope of these randomizations. Portions of 
the correlogram that are outside the envelope are 
significant. Unless one has a prior hypothesis and is testing 
significance at only one distance bin, any significance test 
on a correlogram faces a problem of 'multiple tests'; if 
there are 20 bins there are 20 tests of significance and the 
probability of making a type I error (believing the null 
model is rejected when it should not be) is high. All the 
classic approaches to multiple tests apply. The Bonferroni 
test (use significance levels of 0.05 In instead of 0.05) is 
used most commonly, but it is overly conservative (Garcia 
2004 and see solutions therein). 


Continued 
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Box 12.1 ( Continued) 

A very similar technique can be used to test for statistical 
significance on the K(d) and g(d) statistics on point data. 
Simulate many replicates of CSR on the data (randomly 
relocate each point) or, equivalently, simulate a Poisson 
process with the observed number of points. Then calculate 
the statistic of interest, say g{d), on each simulation. Then 
the 95th percentile envelopes can be drawn. Places where 
the statistic goes outside the envelope are significant. 

Issues of multiple tests arise as in the tests of 
correlograms. 

. Mantel tests 

A Mantel test (Mantel 1967) is really nothing more than a 
way of calculating statistical significance on the correlation 
between two distance matrices. In one scenario, a Mantel 
test serves as a test for second-order effects. Here, one 
matrix would hold the physical distance 11/ -/|| between 
each pair of points in the appropriate cells of the matrix. 

The other matrix would hold the difference in observed 
values, z-, -Zj. In the other scenario, the Mantel test serves 
as a test of association at a distance between two variables 
similar to cross-kriging (section 12.2.3). Here one matrix 
contains the distance between points in, say, environmental 


than 1. There has been a great deal of criticism 
levelled at the ID (Hurlbert 1990) and it is true 
that ID = 1 does not imply a Poisson process, but 
ID > 1 or ID < 1 does imply clumping or reg¬ 
ularity respectively. A more telling criticism is 
that this measure has no spatial structure (two 
quadrats 1 km apart are compared in the same 
way that two quadrats 10 m apart are), but this is 
such a quick and dirty measure it belongs in the 
toolbox of spatial analysis. The first attempts to 
incorporate spatially explicit measures of second- 
order interactions (where quadrats far apart are 
treated differently to those close together) apply 
only to point-process data and are based on dis¬ 
tance to the nearest point. Diggle's function G(d) 
is defined as the fraction of points p, where 
the nearest neighbour of p, is at a distance less 
than or equal to d. Under the null hypothesis 
of CSR, G{d ) = 1 — e _w2 (see Box 12.1) and an 
empirically observed G(d) can be plotted against 
the null G(d). First-order information (inten¬ 


space (temperature, precipitation, etc.), while the other 
matrix contains the distance between points in, say 
biodiversity, z, — z y (or distance in community structure 
using a measure of dissimilarity like Sorenson or Morisita 
Horn; see Chapter 6). Once one has two distance matrices, 
whatever the exact scenario, the process is the same. Each 
matrix is square with one row and column for each point. 
One first calculates a correlation coefficient, r, on the 
n(n - 1)/2 cells in the matrix above the diagonal. The 
calculation can be a Pearson or Spearman correlation. This 
value r then serves as a measure of association between 
the two distance matrices just as in traditional correlation 
analysis (and ranges from -1 to +1). The key innovation is 
in using a permutation test to test significance, which is 
necessary because of the spatial autocorrelation between 
the sites. Because a site is represented by both a row and a 
column, one can not just randomly reshuffle the entries in a 
matrix, but must simultaneously shuffle rows and columns. 
Under this constraint, many replicates (e.g. 999) of multiple 
permutations of one matrix are done and then a new r 
correlation is calculated between the cells of the two 
matrices for each replicate. The significance of the 
unshuffled r value can be assessed as the percentile of the 
true r vs all of the r values calculated from permuted 
matrices. 


sity) can also be derived from nearest-neighbour 
information under (and only under!) the assump¬ 
tion of CSR, where a maximum likelihood esti¬ 
mate gives A = 4/(ird 2 ) (Diggle 1983). The related 
function F(d) gives the probability that the dis¬ 
tance from a randomly chosen location in the 
area (which will usually not coincide with an 
observed point) to the nearest observed point 
is less than d. The use of these functions and 
related ideas based on nearest-neighbour dis¬ 
tances, such as the Clark-Evans ratio, the Hop¬ 
kins ratio, and the Blyth-Ripley ratio, are los¬ 
ing favour because they are poor estimates of 
second-order effects under some scenarios. Imag¬ 
ine the scenario where every point in a ran¬ 
dom process is replaced by two points a con¬ 
stant, very small, distance apart. The nearest- 
neighbour distance will imply perfect regularity 
(nearest neighbour at a constant distance) even 
though the data are strongly clumped at a slightly 
larger scale and generally give a poor picture 
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of overall second-order effects at greater distances 
(see Fig.12.3e and f). As a result, other than 
the quick and dirty use of the ID, second-order 
effects should be estimated using estimates that 
incorporate all spatial distances, not just nearest 
neighbours. The exact nature of these estimations 
depends on whether the data points are nature- 
controlled (point-process) or human-controlled 
(quadrat and geospatial). 

• 'Grams' and kriging: We cover the approach 
for quadrat/geospatial data first, where there 
are several closely related ways to describe 
second-order effects (Fig. 12.4c and d). The most 
direct is the idea of covariance Cov(z,, z,) = <j, ; = 
\ £"=i(A* ~z)(z / k -z). Note that Cov(z,, z,) = 
Var(z;) = of). Such a definition assumes that 
we have multiple measures of z,(fc = l...n) 
at each spatial location, which is rarely the 
case (although it could happen if, for exam¬ 
ple, we take repeated measures over time). 
This definition is also extremely complex in 
that a covariance is calculated for each pair 
of points. Both of these problems can be 
fixed by studying C(d) = cr ;/ (d) = A E|;-/n=i( z ; “ 
z)(z ; — z). Here we assume covariance depends 
only on the distance, d, between two points \\i — 
j || and calculate the covariance between all n, ; 
pairs of points that are distance d apart. On grid- 
ded data and for discrete distances there are likely 
to be many points that are exactly d units apart for 
d = 1,2.On geospatial data, where d is con¬ 

tinuous, it is unlikely to find two pairs of points 
that are exactly the same distance apart. In this 
case it is common to group data into bins and 
take all points that are between d and d + bd units 
apart. This binning is similar to what occurs in the 
creation of a histogram. A rule of thumb (Rossi 
et al. 1992) is that each bin should have 30-50 data 
points. Moreover d should typically only go up to 
half the maximum distance between points (when 
d is close to the maximum observed distance one 
is only comparing edges to edges, which may 
not be representative). A plot of C{d) vs d is 
called a covariogram. According to the first law of 
geography, the covariance should decrease with 
increasing distance, eventually reaching zero, and 
this is in fact what is observed. Typically C(0) = 
ct 2 (the variance in the data) and this decreases 


in an exponential- or hyperbolic-like fashion to 
C(oo) = 0. Certain circumstances can also cause 
negative covariances. For example, if the data 
form a peak (e.g. Fig. 12. le) then distances that 
match the distance from peak to the valley around 
it will actually have a negative covariance. Sev¬ 
eral closely related plots can also be calculated 
(see Fig. 12.4c). A correlogram p(d) is simply the 
covariogram C (d) rescaled to vary between -1 
and 1, just as a correlation coefficient does (the 
rescaling is based on the fact that Pearsons r = 
cr, ; /[a,oy]). A variogram analyzes not the covari¬ 
ance of two points but the variance of the dif¬ 
ferences of two points as V(d) = E [(z, — zj) 2 ] = 
A £| |HM (z — Zi) 2 with the same notation as for 
the covariogram. A factor of two appears because 
the pair i, j is counted twice in z, — z, and z ; — z,. 
Thus most often the semivariogram defined as 
y(d) = V(d)/ 2 is more convenient to analyze (so 
for example y(oc) = o 2 while V(oo) = 2a 2 ), but is 
often confusingly just called the variogram. The 
semivariogram is related to the covariogram as 
y(d) = o 2 — C(d) (although in practice estimates of 
y(d) and C(d) do not fit this relationship, for exam¬ 
ple y(0) is rarely 0 as this formula would require). 
In general, if the data violate the assumptions of 
stationarity (discussed in Box 12.2) the equiva¬ 
lences begin to breakdown. Thus all four 'grams' 
are related linearly (I use 'grams' as a generic 
term to refer to the set of the four different types 
of plots: C(d), p(d), V(d), and y(d)). In summary, 
correlograms and its relatives provide a simple 
graph that enables one to determine at what spa¬ 
tial scales the variable of interest is interacting (the 
regions of d where the highest or lowest regions of 
the graph occur).They also allow for determina¬ 
tion of whether the effects are positive or negative 
(the sign of the graph). Kriging, which is a spa¬ 
tial form of smoothing, makes use of 'grams' to 
determine how to weight the points. In a certain 
technical sense (minimum variance) this is the 
optimal weighting. See Box 12.2 for more details 
on kriging. 

• Second-order processes in point-process data: 
Earlier in the chapter a fundamental distinction 
between data where the point locations were 
driven by nature and where they were chosen by 
the human was set out. The 'gram' analyses in the 
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Box 12.2 More details on grams 


Because correiograms and related 'gram' techniques are so 
important (arguably the single most important tool in 
spatial ecology), more details are given here for the 
interested reader. 

Correiograms or semivariograms? 

The main difference between the 'grams' is that the 
covariogram and correlogram isolate interactions only at 
distance d, while the (semi)variogram describes an 
accumulation of effects up to distance d. In this sense the 
difference is conceptually similar (although not 
mathematically equivalent to) the difference in probability 
theory between a probability theory density function (PDF) 
and a cumulative density function (CDF). The strengths and 
weaknesses of the different approaches come from this 
fact. I find the correlogram the easiest to interpret. If the 
line is above 0 at distance d then there are positive 
(attractive) second-order effects at distance d and similarly 
for negative effects. This is less obvious from a 
(semi)variogram. All regions d of the variogram are positive; 
where the slope is steep represents high positive correlation 
while a shallow slope represents low correlation. In other 
words, a high variance at distance d,V(d), may be due to 
processes causing high variability neard = 0 or to 
processes causing high variability near to d or at any 
distance in between. Conversely, however, estimates of the 
variogram are generally considered more robust to outliers 
and hence more accurate. This is largely because the 
first-order process for a variogram, £(z, - z ; ), must equal 
zero, but the first-order process for a covariogram, £(z,), is 
not generally zero and must be estimated. 

Grams on gridded/lattice data 

The use of 'grams' has come from the world of geospatial 
data where points are random, continuous distances apart. 
However, they carry over with little modification to gridded 
data. Indeed, gridded data can make their application 
easier in practice. For continuous data one must find an 
appropriate bin size 8 so that enough points occur in each 
bin. In gridded data, this is not a problem. The natural 
distance of the grid can be used, where it is guaranteed 
that there will be many pairs of points that are one cell 
length apart, two cell lengths apart, etc. Early analyses of 


gridded data ignored most diagonal pairs (e.g. a cell three 
cells North and two cells East), but if each grid cell is 
located at its centre these distances can also be calculated 
and the bins can be centred around discrete cell lengths as 
in 0.5-1.5, 1.5-2.5, 2.5-3.5, etc. cell lengths. Prior to the 
dominance of the ’gram' format, the Moran's / and Geary's 
C statistics were used. These were typically applied to 
gridded data and used only on non-diagonal distances. 
Interestingly, aside from being designed for gridded data, 
Moran's / corresponds exactly to the correlogram and 
Geary's C corresponds to the variogram divided by ct 2 
(Fig. 12.5). Moran's/and Geary's C are also often 
commonly applied to irregular polygon data such as when a 
variable is known for each county or shire. In these cases 
an adjacency matrix W is developed that gives the distance 
between each pair of polygons. Moran's / and Geary's C 
can then be generalized to use W instead of the classic 
discrete cell distances. However, in the end all these 
methods directly relate to the correlogram and variogram. 
They are treated here as conceptually identical with the 
difference occurring only in the practicality of the 
calculations (which can be relegated to software). One 
minor difference that is important is that, unlike the 
correlogram, Moran's / can go outside of the interval 
(-1,1) and for no correlation (i.e. CSR) has 
/ = - 1/(n - 1) instead of / = 0, where n is the 
number of observations. As n gets large the 
difference disappears. 

Describing the shape of a gram 

Aside from visual inspection of a correlogram or 
semivariogram, several analyses can be done. Variograms 
are often described by three measures: y(0) is called the 
nugget (this is theoretically zero but empirically usually 
not), the asymptote, y(oo) = cr 2 is called the sill, and the 
distance d at which the sill is hit is called the range. To 
approximately describe shape over the entire range of 
distances, one can use a model of the form y(d) = f(d\9) or 
p(d) = G(d\d), where 6 is a set of parameters that can be 
fit to the empirically estimated grams. This can smooth out 
some of the noise in estimation. It also allows comparison 
of the parameters between different 'grams'. The 
semivariogram is an increasing, decelerating, asymptoting 
function and is typically fit by the spherical 
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model f(d\r) = a 2 [3d/2r — d 3 /(2r d )\ if d < r or <r 2 if 
d > r, by the exponential f{d\r) = a 2 (1 - e~ d/r ), or the 
Gaussian f{d\r) = c> 2 (l — exp (— d 2 fr 2 )^. All three 

models pass through ( 0 , 0 ) and asymptote at (oo, cr 2 ) with 
one scale parameter r. Offsets can be added to allow for a 
sill (e.g. f(d\r) = a + (cr 2 - a )(1 - e~ d,r ) for the 
exponential. The correlogram is typically fit by simple 
exponential decay G(d\r) = exp (—d/r) and the 
covariogram by G(d\r, cr 2 ) = cr 2 exp (-d/r). 


Kriging (interpolation and smoothing with 
grams) 

Kriging, mentioned in section 12.2.2 as a means of making 
predictions of intensity that incorporate both first- and 
second-order effects, builds directly on the fact that the 
covariogram can be described by only the two parameters, 
a 2 and scale r. in the local smoothing methods of 
section 12.2.3 the question arose of how much weight to 
give to points at a distance d away, and the only answer 
was to choose an arbitrary smoothing or bandwidth 
parameter h. The covariogram and a fitted model 
G(d\r, cr 2 ) provide an obvious answer: weight points that 
are a distance d away by the amount G(d\r, cr 2 )! The 
empirical covariogram curve contains many degrees of 
freedom and could not be practically estimated, but if 
we fit a model and boil it down to two parameters, 
we only lose two degrees of freedom and now get an 
empirical estimate of how best to incorporate second-order 
effects. 

We can now return to equation 12.1 
(z(s) = /n(s) + 77 (s) + s) where fj. is some surface (e.g. 
linear) of spatial location and £ is a function only of the 
distance between two points and is described completely by 
the covariogram and its two parameters (77 is assumed to 
be distributed normally with mean zero and variance r 2 , 
just as in traditional regression). Equation 12.1 essentially 
now conforms to the generalized least squares (GLS) or 
mixed model forms of regression. Although not well 
known, these are standard statistical techniques, and 
methods for fitting them are readily available (typically 
involving some iterative solutions and likelihood methods). 
In practice the semivariogram (which is more stable) is used 
rather than the covariogram. Conceptually, this is all there 
is to kriging. The details are left to the software. Several 
flavours of kriging exist. Simple kriging assumes 
n(s) = 0 (or that /x is known instead of estimated) and 


is not too common. Ordinary kriging assumes fj.(s) = c with 
c to be estimated and requires that the user ensure that 
there are no trends in the data (or that the trends 
be removed and residuals analyzed; see next paragraph). 
Universal kriging models the full equation 12.1 with 
first-order and second-order effects treated 
simultaneously. 

The key assumptions: stationarity 
and isotropy 

The assumption made throughout this section on 'grams' 
(that second-order effects - i.e. covariance - depends only 
on d) is in fact an assumption that may be untrue. One 
common violation is when covariance depends on direction. 
This could occur if dispersal limitation is a major driver of 
second-order effects and dispersal depends on prevailing 
winds or stream currents. When covariance is independent 
of direction we call it isotropic and we call it anisotropic if 
covariance is a function of direction. Another violation is 
when the covariance depends on location. This could occur 
in the previous scenario if the strength of prevailing winds 
or currents varies significantly across space. A full definition 
of stationarity (the usual assumption in spatial statistics) is 
that the mean is constant (expected value £(z,) = ^ 
independent of location /) and that the covariance c# 
depends only on the distance between / and j. The 
condition on the mean is probably more commonly violated 
than the condition on the covariance. For example, a simple 
gradient or trend in z,- across space violates stationarity. It is 
common to fit a trend surface to the data and then to 
analyze the second-order structure (covariance) on the 
residuals, thereby removing the trend. On gridded data, 
differencing z' = z M - z ; will also remove a trend. With 
the exception of some analyses that deal with anisotropy, 
the covariance condition for stationarity is almost always 
assumed to be true. If worried, this can be checked using 
local autocorrelation statistics. Local autocorrelation 
statistics (Anselin 1995) calculate a measure of correlation 
for each point just with nearby points but do not average 
this correlation together with the neighbour correlations of 
other points (as is done in a global correlogram). The result 
can be plotted as an intensity of local correlation. If this 
varies drastically across space, then the assumption of 
stationarity may not be justified. If a particular process is 
hypothesized (such as dispersal limitation) to cause the 
covariance structure, then local autocorrelation statistics 
can be an indicator of the varying strength of that process 
across space. 
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previous section are the dominant tool for study¬ 
ing second-order effects in human-driven data. 
They look at differences in z as a function of dis¬ 
tance between the points (d, ; ). In point-process 
data, we look instead at the number of points 
within a neighbourhood of distance d. The F and 
G statistics mentioned earlier are examples of this 
but in their focus on nearest neighbours can be 
dominated by smaller scale second-order effects. 
The superior approach is known as Ripley's K. 
This method estimates the average number of 
points with a neighbourhood of radius d around 
the points in the point process. This is then nor¬ 
malized by dividing by the intensity. By explor¬ 
ing various radii around a point, it avoids the 
problems of nearest-neighbour statistics. K (d) is 
simple to estimate. If N(d, p,) is the number of 
points in a radius d around point p, (not including 
the point p, ), then 

K(d) = — Y] N(d, p^ L(d) = d-J — 

tifA V 77 

Under the Poisson null hypothesis (Box 12.2), the 
average number of points in an area A is pA and 
for a circle of radius d it is pnd 2 . K(d) = the aver¬ 
age number of points/ p = vd 2 . This fact is used to 
create the closely related statistic L, where L(d) = 
0 for all values of d under CSR. When L(d) > 0 
then the points are regular at scale d and clumped 
if L(d) < 0. Unfortunately there is not a strong 
convention for L(d) and some people use L*(d) = 
—L(d), so it is important to read carefully when 
looking at L(d) diagrams. Conceptually, K(d) and 
L(d) are similar to the semivariogram and the cor- 
relogram (Fig. 12.5). K(d) and semivariograms are 
cumulative and monotonicaUy increasing. Correlo- 
grams and L(d) are referenced relative to zero (with 
CSR giving a flat line at zero). Locations where 
the correlogram or L{d) are noticeably above or 
below the line (equivalently K(d) and semivari¬ 
ogram change slope sharply) indicate clumping or 
regularity at those scales. I prefer an ecologically 
less-well-known alternative to L(d) that is common 
in physics and known as the pair correlation func¬ 
tion g(d) = K'(d)/{2-nd) (Wiegand & Moloney 2004). 
Thus K(d) is a cumulative density function and 
g(d), being the rescaled derivative to it is related 
to the probability density function. The rescaling 


ensures that g(d) = 1 under CSR. Thus, from an 
interpretation point of view g(d) plays a very sim¬ 
ilar role to L(d) and correlograms but it is shifted 
up to g(d) = 1 instead of L(d) = 0 or p(d) = 0. So 
g(d) > 1 equates to clustered patterns and g(d) < 1 
relates to regular patterns. I prefer g(d) because L(d) 
is derived from K(d) under the assumption of CSR 
while g(d ) is derived from K (d) more generally with 
no assumption of pattern. Perhaps the best-known 
application of g(d) to ecology is by Condit and 
colleagues (2000), who looked at the spatial aggre¬ 
gation of tropical trees; they called their function 
omega, Si(d), but this is just a rescaled version of 
g(d), which has much longer precedence outside of 
ecology. 

12.2.3 Associations between two variables 

Section 12.2.1 dealt with first-order effects with 
one variable and section 12.2.2 dealt with second- 
order-effects with just one variable. Of course it 
is very common to have more than one variable. 
For example, one might wish to explore how the 
abundance of one species covaries with the abun¬ 
dance of another species or whether large-ranged 
species covary with a different set of environmental 
variables than small-ranged species (Jetz & Rahbek 
2002). Several techniques are available to address 
such questions. 

Cross-correlograms and other 'cross-' (second-order) 
associations 

Nearly all of the second-order techniques already 
identified in section 12.2.2 can be applied to two 
variable problems. So, for example, where a covari- 
ogram looks at the average value of (x, — x)(x ; — x), 
the cross-covariogram looks at the average value 
of (y> — y)(xj — x) (where y and x are two different 
variables and and i and j are still two different 
points). This is called a cross-covariogram. Cross- 
correlograms, cross-semivariograms, and cross- 
K (d) and cross -g(d) statistics can also be calculated. 
These statistics reveal the second-order interactions 
between two variables, for example does high soil 
moisture nearby at point B make high species rich¬ 
ness more (positive correlogram) or less (negative 
correlogram) likely at point A. Cross-correlograms 
can also be used to leverage information from vari- 
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Nature located 


Human located 



Figure 12.5 The relationships between the most common second-order spatial statistics. In almost all cases, the conversions are simply a matter of 
linear rescaling (g(d) = a x f(d)+b) or moving between a cumulative vs a local form. Minor adaptations are also needed when moving from continuous 
space (geospatial data) to discrete space (gridded data) or from two dimensions to one dimension. In this figure, d = distance between two points, 

A = intensity (average number of events per area), W = aconnection matrix (on gridded data or on areal data such as a map of states), and 
a 2 = Varizj) the variance of the z value across all points in space. The most commonly used technique for each type of data is in bold print. The phrase 
'grid + d -*■ W' indicates that the data, instead of being located continuously in space, is located on a grid and consequently the idea of a distance, d, 
between two observations is replaced by a notion of adjacency, summarized in the matrix W. Note that there are no arrows connecting point data to 
geospatial data, indicating that the match is conceptual rather than mathematical. 


ables that are measured at fewer locations than 
the target variable of interest in the form of cross- 
kriging. These models are all analogous to tradi¬ 
tional correlation in that there is symmetry between 
the variables involved. 

Spatial regression (first-order associations and possibly 
second-order associations) 

If a more quantitatively predictive model is 
required and there is a presumption about which 
variables are dependent and independent, then 
regression techniques are more appropriate. Imag¬ 
ine trying to predict species richness (S) as a func¬ 
tion of temperature (t) and tree height ( h ), using 
measurements of all three variables at many sites 
across space. The simplest approach is a clas¬ 
sic multivariate regression S,- = ft + ft x f,- + ft x 
hi + s, for i = 1... n fit via ordinary least squares 
(OLS). The problem with this approach is that a 


key assumption is that the errors are independent 
(Cov(ei, ej) = 0). This is almost never true in a 
spatial context and would technically count as 
pseudoreplication (Hurlbert 1984). Aside from this 
worry, we might actively wish to incorporate first- 
order effects or second-order effects. Further com¬ 
plicating the picture is that the second-order effects 
might come in as an effect of S, on S, (spill-over 
effects, possibly due to dispersal), f ( on f, (tem¬ 
perature is nearly always autocorrelated), or by 
effects of Sj on e, (autocorrelation of errors). At the 
extreme we could imagine setting up a regression 
such as: 

Si = ft + ft X t{ T- ft x hj + yIVS 4- aVt 

+[*> y\l + v + Si (12.2) 

where W is a matrix giving spatial distance derived 
weights (and with Os on the diagonal so that S, does 
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Table 12.3 A summary of symbols used. 


Ms) 

Ms) 

E 

Zi.z(s) 

w< 

k(d,h) 

h 

n 

lli-Jlld/j 

d 

ID 

6(d) 

C(d), Md) 

W), y(d) 
m,L(d),g(d) 

me\ cm 
m. m 

CSR 

f[] 


Intensity: the average value (first-order effect) 

Covariance or effect at a distance (second-order effect) 

Uncorrelated error 

A measured value (e.g. density, temperature) at point / or location s 
A weighting for point i 

Kernel (probability density function) used in kernel smoothing 

Coefficients in regression models (fi, £) or parameters (6) in non-linear functions 

The smoothing or bandwidth parameter (bigger h smooths over larger areas) 

The number of points 

The distance between points i and j 

Distance 

Index of dispersion 

Diggle's function for point processes 

Covariogram and correlogram, respectively: second-order effects for geospatial data 

Variogram and semivariogram, respectively; cumulative second-order effects for geospatial data 

Ripley's K for point processes, L(d) is the null for K(d). g(d) is the pair-correlation function 

Functions fit to (semi)variogram and covariogram/correlogram, respectively 

Moran's / and Geary's C functions analogous to grams on lattice data 

Complete spatial randomness: a null model for spatial processes 

Expected value (mean) 

Standard deviation and variance 


not depend on S, ) and y indicates the strength of 
such effects (i.e. is a coefficient to be estimated), 
similarly for V as a spatial weight matrix and a as 
a coefficient, [x,y] is a matrix of the spatial coor¬ 
dinates of the points with £ being coefficients (i.e. 
fitting a linear trend surface) and -rj a random term 
with E(g) = 0 and Var(rj) being a covariance matrix 
of second-order effects (possibly derived from a 
model of a covariogram). The term [x, y]£ incoro- 
porates first-order effects (and presumably is cap¬ 
turing the effects of environmental variables that 
were not measured and used in the model). The 
terms yWS and aVt capture the second-order effects 
from the dependent and independent variables and 
are collectively known as 'lag' (derived from time- 
series lags) or autocovariate terms while rj cap¬ 
tures second-order effects as an 'error' term and by 
depending on a covariance matrix is closely related 
to the 'gram' techniques above. To my knowledge 
no one has ever been crazy enough to try and fit 
the entire model given above, but a variety of tech¬ 
niques have been proposed which fit some subset 
of the model. Such regression techniques that use 
some subset of equation 12.2 are known as spa¬ 


tial regression and, being explicitly aware of spatial 
relationships, are designed to address the shortcom¬ 
ings of OLS on spatial data. A good review of var¬ 
ious spatial regression models is by Dormann and 
colleagues (2007). 

Very often when a field sees a new technique 
introduced, there is a period where methods 
proliferate and become increasingly confusing 
and then finally a consolidation phase is reached 
where the strengths and weaknesses are identified 
and the toolkit is narrowed back down to one or 
two techniques. The study of spatial regression (at 
least in ecology) appears to be just entering this 
consolidation phase with a robust debate occurring 
(Lennon et al. 2000; Jetz & Rahbek 2002; Diniz 
et al. 2003; Dormann 2007a,b; Hawkins et al. 2007; 
Kissling & Carl 2008). Although it is still early, I will 
provide my personal recommendation. First, there 
is growing evidence that the original non-spatial 
OLS may not be so bad. A regression produces three 
results, estimates of the coefficients ,S, a measure of 
fit ( r 2 ), and a measure of the significance (chance 
of the null hypothesis of /3 = 0 being true). Theory 
predicts, and studies have shown (Dormann 2007b; 
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Hawkins et al. 2007; Kissling & Carl 2008), that 
even in the face of spatial autocorrelation the 
estimates of /3 are unbiased (on average correct), 
almost as efficient (i.e. standard errors of /S only 
slightly larger) as spatial regression estimates, and 
the r 2 for OLS is lower than in spatial regression 
but spatial regression has more parameters. The 
only real problem with OLS is that the p values are 
very wrong (much more type I error than reported 
by the p value). If one only cares about prediction 
and not testing then this is not a problem. If one 
cares about testing, then there are two choices. 
Simulations have shown that the p value can be 
very accurately corrected using DutellieTs method, 
which adjusts the degrees of freedom in the 
model downwards depending on the amount of 
autocorrelation (more degrees of freedom lost with 
high autocorrelation) (Dutilleul et al. 1993; Dale & 
Fortin 2002; Fortin & Dale 2005). The other choice to 
get accurate p values is to use a spatial regression 
model. Which one to use? There is growing 
evidence that lag-based models can be biased 
and should not be used (Dormann 2007a). This 
leaves the error approach, where 77 is incorporated. 
This has the added advantage of boiling down 
to the already known general least squares (GLS) 
or the slightly more general linear mixed model 
(LMM) approach and the use of covariograms 
to describe 17 . A further advantage is that these 
techniques readily generalize to binomial (logistic) 
and Poisson regression. So my recommendation is 
to (i) run an OLS, (ii) calculate a correlogram on 
the residuals y, — y,, (iii) if the correlogram shows 
significant autocorrelation in the residuals then do 
not use the p values from the OLS, and (iv) if you 
need p values then use spatial regression based 
on errors (GLS/LMM) if and only if second-order 
effects are of interest to you but otherwise just use 
DutellieTs correction to the OLS p values. This 
recommendation will be enormously controversial 
but it is my best read of current evidence combined 
with an inclination to stick with tried and true 
methods unless there is a compelling reason 
not to. 

Variance partitioning 

As one might imagine, interpreting all of the 
coefficients in the regression of equation 12 . 2 , even 


if only a subset of it is used, can be overwhelming. 
A useful tool is variance partitioning (Borcard et al. 
1992). Variables can be lumped into groups (e.g. 
environmental and spatial), and then the amount of 
variation explained by each group can be assessed. 
Of course the environmental and spatial variables 
are collinear (correlated with each other) and 
regression does not have the ability to resolve 
this. The result is that a percentage variance is 
assigned to each of four categories: environmental 
only, spatial only, environmental and spatial 
combined, and unexplained. Together these 
percentages sum to one (the sum of the first three 
is the r 2 of the environmental + spatial model). 
Calculating these numbers is easy. Run three 
regressions (environment + spatial, environment 
only, and spatial only) and note the r 2 . Then 
%environment only = (r 2 nvironment+spatial - r 2 patial ), 
%spatial only = (r 2 nvironment+spatial - r 2 nvironment ), 

combined = 


% 


spatial 

and 


%environment/spatial 
^environment+spatial “ %envir 0 nment Only ■ 

Only — ^environment ^spatial ^environment+spatial 

%unexplained = 1 - r 2 nvironment+spatial . Very often the 
combined environment/spatial variance explained 
is much larger than either factor alone, which 
is disappointing as this is the least informative 
category. It is not surprising, but worth noting, 
that the proportions assigned to each category 
can depend on how many variables are used in 
each category. Dozens of environmental variables 
and just x — y spatial coordinates biases toward 
environment explaining a higher proportion, 
and using a few environment variables while 
using a complex representation of space (e.g. 
PCNM; Dray et al. 2006) can tilt things the other 
way (Jones et al. 2008). The variance partitioning 
approach just described treats each factor as 
conceptually equal. An alternative approach can 
treat one factor as having precedence. In this 
case if we give environment logical primacy 
over spatial, then %environment = r 2 nvironment and 

/oSpatial — Anvironment+spatial ^environment" This W3S 

done for example by Lichstein and colleagues 
(Lichstein et al. 2003) and also done implicitly 
when using regression on residuals (e.g. Wilcox 
1978). There is considerable controversy again over 
which is the right approach. Ultimately, however, it 
is not a question answered by the mathematics, it 
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is a matter of assumptions and appropriateness for 
the question at hand. 

12.2.4 Software available 

I have consciously chosen to emphasize the concep¬ 
tual unity of spatial statistics. However, in doing 
this I have swept under the rug a large number of 
issues. The practical calculations can depend heav¬ 
ily on the exact type of data used. Moreover, I have 
completely ignored the issue of edge effects (the fact 
that some points are near the 'edge of the earth' or 
at least the edge of the sampled data raises compli¬ 
cations for many methods). There are several meth¬ 
ods for dealing with edge effects but they compli¬ 
cate the calculations. Finally, statistical significance 
tests are most often done by randomization meth¬ 
ods, which are not conceptually difficult but require 
additional computer code. For all of these reasons, 
I strongly recommend using off-the-shelf software 
for spatial analysis rather than implementing your 
own methods. Two excellent and free pieces of soft¬ 
ware are available for immediate download that 
handle all of the methods discussed herein. The first 
is spatial analysis of macroecological (SAM) data 
(Rangel et al. 2006). This is a custom-built software 
package targeted at spatial analysis of ecological 
data. It contains an easy-to-use interface and an 
impressive list of spatial methods. The more generic 
(and harder to use) package is R, a general pur¬ 
pose statistical package (R Development Core Team 
2005) with the addition of readily available pack¬ 
ages (libraries) in R, including spatial, spatstat and 
splanc (for point processes), gstat (for kriging), and 
spdep (for gridded data) (Bivand et al. 2008). 

12.3 Prospectus 

At this point in time spatial statistics probably 
needs to enter a consolidation phase where the 
emphasis is on simplification, rejection of outdated 
techniques, highlighting the underlying unity of 
methods, and an effort to streamline the communi¬ 
cation of these methods. In practice, the scientific 
community is not well incentivized to do this and it 
may not happen. Additional development in some 
areas is needed, such as improved methods of test¬ 
ing statistical significance and better understanding 


of the strengths of different forms of spatial regres¬ 
sion, but these are relatively minor. 

There are three new techniques that I believe 
deserve highlighting. First, is a technique common 
in the soils literature but that I have seen applied 
only once in ecology (Kendrick et al. 2008), based on 
nested models of variograms (also called coregion- 
alisation). Basically the variogram is fit using piece- 
wise regression. This automated breakout of scales 
immediately suggests (but does not require) distinct 
processes at these different scales and assigns vari¬ 
ance components to make statements about which 
scales are most important. Unfortunately I am not 
aware of software commonly used in ecology that 
performs this test. Another promising technique 
is geographically weighted regression. This model 
is basically a regression on spatial points but the 
coefficients (i.e. the relative importance of different 
independent variables) is allowed to change across 
space (Fotheringham et al. 2002; Wimberly et al. 
2008). In one example, Wimberly and colleagues 
(2008) found that climate limited tick abundance in 
the eastern USA but landscape structure was limit¬ 
ing in the western portions of the range. Instead of 
treating non-stationarity as a nuisance, this method 
embraces and measures it. Finally, an increasingly 
promising alternative to spatial regression and the 
exploration of causal factors of autocorrelation is 
the development of process-based models. Houch- 
mandzadeh (2008) recently developed a model that 
predicts the pair correlation function g{d) under 
the assumption of simple neutral (diffusive) dis¬ 
persal. Stronger models of how a species dis¬ 
tribution is affected by the environmental con¬ 
text will ultimately allow for the teasing apart of 
causality. 

For the actual application of spatial statistics to 
ecological data, I believe we are entering an exciting 
time where software tools can bury the details and 
let the users focus on interpreting and learning from 
their data without having to climb a mountain of 
technical details first. 

12.4 Key points 

1. Spatial statistics is broadly organized around 

zeroth-order, first-order and second-order 

effects. 
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2. There is a fundamental distinction between data 
in which the spatial locations are human-chosen 
versus nature-chosen. Differences within the 
human-chosen category (e.g. quadrats vs tran¬ 
sects vs geospatial data) have important impli¬ 
cations for how calculations are performed but 
conceptually are of little importance and have 
been exaggerated. 

3. Many methods that have been quite popular are 
now outdated (e.g. F and G functions, variance- 
to-mean-ratio/ID method, moving average and 
exponential smoothing) with better alternatives 
available. 

4. Covariograms/kriging/correlograms are proba¬ 
bly the single central concept today and users 
should familiarize themselves with their use and 
interpretation (although not their precise calcu¬ 
lations). Moran's / and Geary's C are members 
of this set. 


5. The most modern methods can be extremely dif¬ 
ficult to calculate by hand and naive implemen¬ 
tations in software are likely to be wrong, but for¬ 
tunately good software is readily available and 
should be used. 
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Alternative measures of diversity 




CHAPTER 13 


A primer of trait and 
functional diversity 

Evan Weiher 


13.1 Introduction 

13.1.1 General definitions 

Trait diversity is synonymous with functional 
diversity and is defined as the degree to which 
coexisting species vary in terms of their functional 
traits. Functional traits are observable or oper¬ 
ationally defined phenotypic characteristics that 
influence species performance and/or ecosystem 
processes (Grime 1973b; Ricklefs & Travis 1980; 
Diaz & Cabido 2001; Poff et al. 2006). The principal 
difference between trait diversity and taxonomic- 
based species diversity is that trait diversity empha¬ 
sizes the phenotypic differences among taxa while 
discounting phylogenetic relatedness. Even so, trait 
diversity is closely allied with phylogenetic diver¬ 
sity (Chapter 14) because functional traits are often 
phylogenetically conserved and because the two 
approaches use highly similar numerical methods. 

Some have suggested that the term 'functional 
diversity' be restricted to measures of trait diver¬ 
sity that affect the functions of ecosystems (Tilman 
2001; Petchey & Gaston 2006). We should be wary of 
unnecessarily restrictive definitions for terms that 
are conceptual, general, and useful (Weiher 2004). 
In this case, functional traits are widely recognized 
as having importance for species performance and 
it is largely for this reason that they also affect 
ecosystem functioning (Chapin III et al. 2000; Diaz 
& Cabido 2001; Gamier et al. 2006). It may not be 
necessary therefore to mark a distinction between 
trait and functional diversity. However, if the traits 
used to calculate an index are known to have no 
bearing on species performance or ecosystem func¬ 
tion, then the more generic trait diversity should be 
used. 


13.1.2 General importance 

There are two key conceptual areas for which trait 
diversity has particular importance in ecology. The 
first area is in community ecology and deals with 
how local communities are filtered from the larger 
regional species pool and how this manifests itself 
in the dispersion of traits across species. The term 
'trait dispersion' has been used in community ecol¬ 
ogy to parallel the study of spatial dispersion pat¬ 
terns in which competition or other processes can 
lead to spatial over-dispersion, where individuals 
are more separated from each other than would 
expected by chance (e.g. Greig-Smith 1957; Ryti & 
Case 1986). When applied to traits, species can be 
plotted in trait space (i.e. using one or more traits 
instead of spatial coordinates) and processes such 
as competition can similarly lead to resource parti¬ 
tioning, which is expressed as trait over-dispersion 
(i.e. the species are more spread out in trait space, 
e.g. Ricklefs & Travis (1980), Moulton & Pimm 
(1987)). Communities that are over-dispersed in 
trait space have high trait diversity. Alternatively, if 
some traits or trait combinations are excluded from 
some communities (e.g. Grime 1973b), then such 
communities may have lower trait dispersion and 
lower trait diversity. Understanding how species 
are filtered from communities according to their 
traits remains a fundamental question in commu¬ 
nity ecology (Keddy 1992; McGill et al. 2007) and 
there is a growing number of examples of how envi¬ 
ronmental and biotic constraints vary along envi¬ 
ronmental gradients to affect trait dispersion and 
functional diversity (e.g. Weiher & Keddy 1995; 
Swenson & Enquist 2007; Pausas & Verdu 2008; 
Filippi-Codaccioni et al. 2009). 
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The second area is in the linkage between com¬ 
munity and ecosystem ecology. A principal ques¬ 
tion is the degree to which trait or functional 
diversity matters to ecosystem functioning. A pri¬ 
mary mechanism by which biodiversity is thought 
to influence ecosystem functioning is through com¬ 
plementarity, or the enhancement of ecosystem 
processes caused by the increased efficiency and 
thoroughness of resource use by organisms with a 
high degree of trait dissimilarity. Examples include 
increased processing of detritus by detritivores that 
have a range of abilities (Anderson and Sedell 1979; 
Heemsbergen et al. 2004), and increased primary 
productivity in plant mixtures with a higher degree 
of functional diversity compared to monocultures 
(e.g. Tilman et al. 1997b; Hooper & Vitousek 1997). 
These ideas have been applied to a wide range 
of taxa, ecosystem types, and ecosystem services 
(Hooper et al. 2005; Naeem et al. 2009). 

13.1.3 A brief history of trait and 
functional diversity 

We should first recognize that Theophrastus began 
the functional trait approach in western civilization, 
and Darwin and Wallace were keenly interested 
in the centrality of phenotypic variation to evolu¬ 
tion and coexistence. In the modern era. Warm¬ 
ing (1909) paid careful attention to the idea of the 
growth forms of plants and the similarities and 
differences in phenotype among coexisting species. 
As an example. Warming described how the herba¬ 
ceous plants growing under a canopy of beech trees 
were functionally quite similar to each other, even 
though they came from many different families, and 
how they were quite functionally distinct from the 
canopy trees. 

The history of trait diversity has been closely 
tied to the development of niche theory, coexis¬ 
tence and competition theory, and the evolution 
of closely related species. In the 1930s and 1940s, 
ecologists first used genus:species ratios as a proxy 
for trait diversity because congeners are gener¬ 
ally more similar to each other than they are to 
non-congeners. Elton (1946) stated that communi¬ 
ties tend to have large genus:species ratios, that is 
larger than expected trait diversity, because similar 
species, notably congeners, have a low tendency to 


coexist. This idea helped lead Hutchinson (1957) 
to the idea of a general limiting similarity based 
on body size ratios. This laid the foundation for 
the growth in competition-based niche theory in 
the 1960s and 1970s in which coexistence was seen 
as dependent on a high degree of dispersion of 
functional traits among species (e.g. MacArthur & 
Levins 1967). At that time, a key focal point was the 
regularity or evenness of body size distributions, 
which is the equivalent of recent indices of func¬ 
tional evenness (e.g. Mouillot et al. 2005; Villeger 
et al. 2008). Evolutionary biologists found support¬ 
ing evidence in the evolution of regular spacing in 
bill lengths (Lack 1947) and in the over-dispersion 
of bill sizes among coexisting Darwin's finches 
(Grant & Schluter 1984). Not all the evidence sup¬ 
ported this view, however. Williams (1964) showed 
that many communities have more species per 
genus than would be expected by chance. This sug¬ 
gested that communities have lower trait diversity 
than expected, but his findings were largely over¬ 
looked. 

By the late 1970s and 1980s, niche theory and the 
centrality of competition were questioned and crit¬ 
icized as not being adequately tested against null 
hypotheses (e.g. Connor & Simberloff 1979). This 
led to a rather heated period of debate in which 
community null models were used to test if commu¬ 
nities were indeed over-dispersed in trait space and 
therefore structured by competition (Strong et al. 
1984). By the 1990s, the methods for using ad hoc 
null models were largely solved (Gotelli & Graves 
1996) and there was growing evidence that some 
communities are over-dispersed in trait space (e.g. 
Weiher & Keddy 1995; Dayan & Simberloff 2005). 
While there was great interest in trait dispersion per 
se, the idea that trait dispersion was another form of 
diversity was also overlooked. 

In 1980, Ricklefs and Travis developed 
a functional ecomorphological approach to 
community ecology that addressed both the 
degree of similarity of coexisting species and the 
amount of trait space occupied by communities. 
The idea of trait space occupation is essentially 
equivalent to trait diversity, and their approach 
was especially important because it translated 
MacArthur's notion of resource partitioning into its 
manifestation in emergent trait-based community 



A PRIMER OF TRAIT AND FUNCTIONAL DIVERSITY 177 


functional parameters. A key area of interest was 
in how trait space is filled as communities grow in 
richness, thus forming an early linkage between 
trait and taxon richness, as well as functional 
community assembly. Ricklefs and Travis (1980), 
and later Travis and Ricklefs (1983) used ad 
hoc methods for estimating the morphological 
volume occupied by communities, but they also 
mentioned prim networks, or minimum spanning 
trees (MSTs), and later Moulton and Pimm (1987) 
used such trees as indices of morphological or 
trait volume. MSTs were used by Moulton and 
his colleagues in a series of papers in the 1980s 
and 1990s showing that introduced avifaunas 
usually have a significantly greater MST (greater 
trait diversity) than what would be expected 
under neutral community assembly. These 
results were generally interpreted as evidence for 
the role of competition in shaping community 
composition. 

Meanwhile, a somewhat separate line of research 
was building on William's (1964) results and the 
idea that coexisting species should be pheno- 
typically similar (i.e. communities should have 
lower than expected trait diversity). These models 
focused on habitat filtering and habitat-dependent 
competition causing the exclusion of species which 
lack certain requisite functional traits, leading 
to assemblage-wide convergence in traits (Grime 
1973b; Southwood 1977; Brown 1987; Keddy 1992). 

In the 1990s, Weiher and Keddy (1995) syn¬ 
thesized the competition-niche model and the 
habitat-filter model of trait dispersion into a gen¬ 
eral hypothetical model in which the amount of 
trait dispersion (trait diversity) increased with 
the importance of competition and decreased 
with the importance of environmental stress and 
with increasing scale. In addition, traits associated 
with habitat filtering can be clumped (with low trait 
diversity) while traits associated with coexistence 
can be over-dispersed within the same communities 
(Weiher et al. 1998). In plant communities, there is 
limited evidence supporting these ideas (Cavender- 
Bares et al. 2004; Swenson & Enquist 2007; Pausas & 
Verdu 2008; de Bello et al. 2009). 

Starting in the 1990s, there was a renewed inter¬ 
est in biodiversity effects on ecosystem function 
(Schulze & Mooney 1994; Kinzig et al. 2001; Loreau 
et al. 2002). Functional diversity in particular has 


emerged from the debate on whether and how 
biodiversity promotes functioning (Diaz & Cabido 

2001) . In several high-profile experiments, primary 
productivity increased with biodiversity and the 
plant number of functional groups (e.g. Tilman et al. 
1997b; Hooper & Vitousek 1997; Hector et al. 1999). 
A key mechanism by which diverse communities 
may promote functioning is through complemen¬ 
tarity, in which species assemblages are able to more 
fully utilize resources via niche partitioning, which 
is expressed through the diversity of functional 
traits (Hooper et al. (2005) and for a more detailed 
discussion regarding primary productivity and 
other mechanisms see Vile et al. (2006) and Fornara 
and Tilman (2009)). Another mechanism for biodi¬ 
versity effects is through facilitation, where individ¬ 
ual species functioning is enhanced by the presence 
of non-conspecifics (e.g. Anderson & Sedell 1979; 
Mulder et al. 2001; Cardinale et al. 2002). Facilitation 
probably occurred because non-conspecific species 
had unique traits and this would also add to the 
functional diversity of the community. 

Even though the diversity-function debate has 
been somewhat controversial, it led to an explo¬ 
sion of interest in the measurement and quantifi¬ 
cation of trait and functional diversity. A principal 
driver was the observation that simply counting 
the number of ad hoc functional types was not a 
particularly rigorous way of estimating functional 
diversity (Walker et al. 1999; Petchey & Gaston 

2002) . Many of the more recent papers have cited 
Walker et al. (1999) as the start of functional diver¬ 
sity measurement, presumably because they explic¬ 
itly linked their measure to ecosystem function. 
The recent work on functional diversity seems to 
have overlooked much of the literature on trait dis¬ 
persion in community ecology. This was fortuitous 
because rather than reinventing the wheel, genuine 
progress has been made in terms of both creativity 
and analytical rigor. Important lines of growth have 
included: 

1 . the expansion of the distance-based methods to 
include relative abundance information (Shimi- 
tani 2001; Botta-Dukat 2005; Laliberte & Legen¬ 
dre 2010) and the generalization of distance 
methods that are appropriate for ordinal and 
other problematic types of trait data (Podani & 
Schmera 2006; Laliberte & Legendre 2010) 
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2 . the development of dendrogram-based mea¬ 
sures, which were adapted from methods for 
measuring of phylogenetic diversity (Petchey & 
Gaston 2002,2006,2007; Podani & Schmera 2006, 
2007; Mouchet et al. 2008; Poos et al. 2009) 

3. the expansion of variance-based methods to 
include both trait information and species abun¬ 
dances (Mason et al. 2003) 

4. the development of new measures of trait even¬ 
ness (Mouillot 2005; Mason et al. 2005; Villeger 
et al. 2008) 

5. the application of convex hull mathematics to 
the measurement of trait volumes (Cornwell 
et al. 2006; Villeger et al. 2008) 

6 . the development of the concept and measure¬ 
ment of functional divergence, which measures 
the degree to which the most abundant species 
have extreme trait values in a way that is inde¬ 
pendent of trait volume (Mason et al. 2005; Vil¬ 
leger et al. 2008) 

7. the development of graph-theoretical measures 
that are independent of species richness (Ricotta 
& Moretti 2008) 

8 . some utilitarian testing of functional diversity 
indices as predictors of ecosystem functioning 
(e.g. Petchey & Gaston 2004; Diaz et al. 2007). 

13.2 State of the field 

13.2.1 Overview 

We might organize some of the various measures 
of trait diversity by three criteria (Table 13.1). The 
first criterion might be whether the index is based 
on species presences only or if their abundances are 
incorporated into the index. The second criterion 
might be the dimensionality of the index in terms of 
whether we are working with a single trait dimen¬ 
sion or multiple traits and hence n-dimensional trait 
space. We can also recognize that trait diversity 
might be estimated in the absence of any quantita¬ 
tive trait data by using an a priori functional clas¬ 
sification scheme or some measure of phylogenetic 
relatedness. We can organize the measures a third 
way, in parallel with the notions of diversity, rich¬ 
ness, and evenness (following Mason et al. (2005) 
and Villeger et al. (2008)). Trait diversity indices 
that correspond to species richness are based on 
species presence only and so they focus on the idea 


of the utilization or volume of trait space occupied. 
Trait diversity indices that correspond to diver¬ 
sity indices incorporate abundance information to 
weight each species' contribution to diversity. Trait 
diversity indices that parallel the notion of even¬ 
ness do so by primarily focusing on the evenness 
or regularity of the distances between the species 
in trait space, but they can also include evenness in 
abundance. A further aspect of trait diversity has 
no simple parallel with taxonomic diversity. These 
indices measure the relative clumping of species in 
trait space and were initially developed to address 
limiting similarity as it relates to species coexis¬ 
tence (Ricklefs & Travis 1980). This idea was fur¬ 
ther developed to measure the degree to which the 
most unique species are also most abundant while 
controlling for the volume of occupied trait space 
(Mason et al. 2005; Villeger et al. 2008). 

13.2.2 Indices of trait and functional diversity 

Functional diversity is usually 0 when only one 
taxon is present. If trait data are available for indi¬ 
viduals rather than taxa, this need not be the case. 
Most of the following indices require only two taxa 
to be present to calculate a non-zero index. Several 
(“MST/ FDiv, FEve) require three taxa, one ( CHV) 
requires variable numbers of taxa, depending on 
the number of traits. The following indices can be 
calculated using the FD package in R, and R source 
code is available from many of the authors' web¬ 
pages. 

Convex hull volume (CHV) (Cornwell et al. 2006; 
FRic in Villeger et al. (2008)) is the volume (or area 
in two dimensions) of trait space occupied by coex¬ 
isting species. In two trait dimensions, the convex 
hull is the perimeter of the polygon defined by the 
most extreme species, ensuring the entire surface 
of the polygon contains no concave indentations. 
It is a multidimensional version of range because 
it collapses to range in the case of a single trait. 
Individuals that fall inside the convex hull do not 
add to the total volume or area, and this makes CHV 
sensitive to only the extreme species that form the 
vertices at the perimeter of the occupied trait space. 
Convex hull volume can be expanded to any num¬ 
ber of trait dimensions, but the calculations are not 
trivial. CHV is sensitive to trait units, but Cornwell 
et al. (2006) found that three rescaling procedures 
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Table 13.1 A general classification of trait diversity indices. 



Species presences only 

Abundances incorporated 

Trait richness 



Zero trait dimensions 

Genus:species ratio 

Phylogenetic methods 
(Chapter 14) 


One trait dimension 

Range 

Variance 

(Platy)kurtosis 


N trait dimensions 

Minimum spanning tree {MST) distance 
Mean distance, MD 

Sum of distances, FAD, MFAD 
Dendrogram-based FD 

Convex hull volume, FRic 

Slope of the cumulative MST, 


Trait diversity 



One trait dimension 


Variance-based fD var 

N trait dimensions 


Quadratic entropy, FD Q 

Functional divergence, FDiv 

Functional dispersion, FDis 

Trait evenness 



One trait dimension 


Functional regularity index, FRO 

N trait dimensions 

Evenness in NND 

Functional evenness, FEve 

Trait dispersion/density of species packing 



N trait dimensions 

Mean NND 

Abundance-weighted Mean NND 


Note that any measure that can incorporate N trait dimensions can always deal with individual traits. Indices are ordered historically, starting 
with the oldest. See text for explanation of terms. 


were highly reliable (r > 0.93) when comparing 
communities with large differences in richness and 
trait diversity. A critical constraint on CHV is the 
requirement that an analysis with T traits requires 
at least T + 1 species, otherwise the volume is 0. (It 
takes two points to define a distance, three to define 
an area, four to define a three-dimensional volume, 
etc.). CHV has no simple formula. 

Dendrogram-based indices (FD) (Faith 1992; 
Petchey & Gaston 2002) were developed in order to 
solve the apparent methodological gap suggested 
by the functional diversity-ecosystem function lit¬ 
erature by using tools for measuring phylogenetic 
diversity (Chapter 14). In the original formulation 
(Petchey & Gaston 2002), a dendrogram is built 
from a trait-based distance matrix for each sample, 
and the total of the branch lengths are summed. 
The method is therefore somewhat similar to MST, 


but with additional clustering steps and a differ¬ 
ent summation procedure. The method was mod¬ 
ified (Petchey & Gaston 2006) in order to make 
the index directly analogous to phylogenetic diver¬ 
sity (PD, Chapter 14). In the modified FD, a single 
dendrogram is built for the species pool and the 
total of branch lengths is summed for the species 
present. 

Dendrogram-based measures tend to scale lin¬ 
early with richness (i.e. they do not meet the 
monotonicity criterion; Ricotta 2005) and they are 
sensitive to trait units. 

Because dendrogram-based methods require 
choosing both a distance method and a clustering 
method, there has been debate regarding methods 
and sensitivity of the results to methods (Petchey 
& Gaston 2006, 2007; Podani & Schmera 2006, 2007; 
Mouchet et al. 2008; Poos et al. 2009). If the data 
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have a wide range of richness and trait diversity, 
then the various methods tend to be quite reliable 
(Petchey & Gaston 2006). However, if the trait data 
are not continuous, or if one is comparing commu¬ 
nities with equal numbers of species, then the meth¬ 
ods are less reliable (Podani & Schmera 2006, 2007; 
Poos et al. 2009) and greater care should be taken. 
Optimal clustering methods cannot be defined a 
priori, therefore a best practice may be to use a 
variety of combinations of distance and cluster¬ 
ing methods to create consensus dendrograms, and 
the original and consensus dendrograms should be 
compared if a single, best method is sought (see 
Mouchet et al. (2008) for details). Alternatively, one 
may view the various resultant FD indices as multi¬ 
ple indicators of a latent variable. There is no direct 
mathematical formula. 

Functional dispersion (FDis)(Anderson et al. 
2006; Laliberte & Legendre 2010) measures the 
mean distance to the centroid of the assemblage 
in trait space, weighted by relative abundances. 
As such, it is conceptually similar to FDQ,FD var , 
and FDiv even though the mechanics of measure¬ 
ment are different. FDis was strongly correlated 
with FDq ( r = 0.96) and it had a triangular limit 
function relationship with FDiv (i.e. the upper limit 
of FDis increased with FDiv ) using random data 
(Laliberte & Legendre 2010). It is worth noting that 
formal significance tests of FDis values can be deter¬ 
mined via homogeneity of multivariate dispersion, 
if abundance information is not used (Anderson 
et al. 2006; see Laliberte and Shipley (2009) for 
details). 

Calculations involve the following steps. The 
weighted centroid is defined as a vector, c, of the 
weighted mean trait values using the abundance, a, 
for each of i species and k traits: 

s 

E i-ik 
C = [Ck] = — s - 

E*. 

i=l 

S 

FDis = i±— 

X> 

i=l 

Then FDis is the weighted mean distance, z, from 
each species to the weighted mean centroid. 


Functional divergence (FDiv) (Villeger et al. 
2008) was developed to expand the notion of 
variance-based FD var to multiple dimensions and to 
standardize the index so that it is independent of 
both the volume of trait space occupied ( CHV ) and 
functional evenness (FEve). It is the degree to which 
the most abundant taxa are far from the centroid 
of the assemblage in trait space, standardized to 
the average distance to the unweighted centroid of 
the species at the perimeter of the convex hull (c.f. 
the weighted centroid of all taxa, FDis). FDiv is high 
when the most abundant species have extreme trait 
values (i.e. are vertices on the convex hull; sit on 
the perimeter of the occupied trait space). Although 
this is common to other measures of trait diversity 
(e.g. FDis, FD mr/ and FDq), FDiv is standardized so 
that it is independent of trait volume and therefore 
it uniquely quantifies this one particular aspect of 
trait diversity. Three taxa are required. 

The calculations require the following steps. First, 
the centroid (or middle of gravity) of the assem¬ 
blage is defined using only the V taxa on the 
perimeter of the convex hull. The centroid G is 
defined as a vector (G = gi, g 2 , ..., gk) of the mean 
of each of k traits 

.if, 

8 k ~ y 2—i ti k 

i =1 

where t is the value of trait k in species i, and V 
is the number of species on the perimeter of the 
convex hull (i.e. the vertices). If the number of traits 
is greater than the number of taxa plus one (if T > 
S + 1), then all the taxa are vertices and S replaces V. 

For each species, the distance to the centroid 
is calculated as the Euclidean distance (see also 
Box 13.1), and then its mean is found: 

r 

dG,= 

N *=i 

1 s 

dG = -J2<tG, 

^ i=i 

From these, two parameters are calculated. The 
abundance-weighted difference between each 
taxon's distance to the centroid and the mean 
distance to the centroid is 
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s 

Ad = Pi (dGi - dG) 

i=l 

Note that Ad is large if abundant taxa have large 
distances to the centroid, relative to the mean of the 
assemblage. The total of the weighted differences is 
summed as 

s 

A\d\ = Pi |^G, - dG | 

;-i 

Finally, 

. Ad + dG 

FDiv = -= 

A\d\+dG 

Therefore, FDiv is the abundance-weighted dis¬ 
tance by which the most common species tend to 
be at tire periphery of the assemblage in trait space. 
The maximum possible value for Ad is dG, so stan¬ 
dardizing Ad to its maximum value forces FDiv 
to have a maximum of 1. If all taxa are equidis¬ 
tant from the centroid (and therefore all sit on the 
perimeter and serve as vertices in the convex hull) 
and have equal abundance, then FDiv is 1. This can 
occur only if the taxa are arranged at equal dis¬ 
tances to the centroid, but regardless of their prox¬ 
imity to each other. FDiv is independent of species 
richness and trait units, but note that trait data 
should still be rescaled. When there are fewer than 
three taxa, the index is of questionable meaning or 
use. FDiv will increase as more species are on the 
perimeter and are vertices, therefore if the num¬ 
ber of traits is high relative to the number of taxa, 
FDiv will tend to increase with richness because all 
the taxa will be vertices of the convex hull until 
S + 1 = T. 

Functional evenness ( FEve ) (Villeger et al. 2008) 
combines both the evenness of species spacing 
in trait space and the evenness of species rela¬ 
tive abundances. It measures the consistency in 
the branch lengths of the MST after weighting the 
branch lengths by the sum of the relative abun¬ 
dances of the two species at the ends of each 
branch. The index is 1 if all species have equal 
abundance and if all the branch lengths have equal 
length, and it declines toward zero with increas¬ 
ing unevenness in either aspect. This index is dif¬ 
ferent from measuring the standard deviation of 
the nearest-neighbour distances because it uses the 


MST, not just nearest neighbours. By doing so, it 
truly measures the regularity of spacing in trait 
space, rather than the regularity of species pack¬ 
ing. FEve can be used without abundance infor¬ 
mation, if one wishes to estimate trait evenness 
separately from evenness in abundance (the sim¬ 
plified methods are provided below). It is inde¬ 
pendent of species richness and measures of trait 
diversity such as CHV or MST. FEve is not defined 
for assemblages with fewer than three taxa because 
three points are required to assess the regularity 
of spacing patterns, just as it takes three points 
to define a plane. Alternatively, one might use the 
index and conclude that assemblages with fewer 
than three taxa have high evenness (because there 
is no variation in the distances between taxa in trait 
space). Great care should be taken in the interpre¬ 
tation of such values because they at least partly 
nonsensical. 

FEve is calculated from an MST where is the 
branch length from species i to j, S is the number 
of species, and p is the relative abundance: 


FEve = 


s-i 

E min 
1=1 



EW, = 

Pi + Pi 


i 

s-i 

-where 


In the absence of abundance information, the equa¬ 
tion simplifies to: 


E m i n ( mst ’ s-i ) s-i 


The standardization with the inverse of richness 
minus 1 is needed for two reasons. First, in the case 
of perfectly even spacing and abundance, the values 
within the minimization will always be 1/(S — 1), 
and this forces the index to have a maximum of 1. 
Second, in cases where there is not perfect evenness, 
it accounts for bias caused by the fact that there will 
always be one case where the value in the mini¬ 
mization is less than or equal to 1/(S - 1) (Villeger 
et al. (2008) cite Bulla (1994) for more details). In 
practice, the standardization may tend to increase 
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the amount by which FEve decreases from 1 as the 
assemblage becomes less even. 

Mean distance (MD) (Weiher et al. 1998; Heems- 
bergen et al. 2004) between coexisting taxa mea¬ 
sures the average differences among taxa and is a 
simplistic estimator of trait volume. Mean distance 
is analogous to the mean phylogenetic distance 
(MPD; Chapter 14). It is sensitive to distance mea¬ 
sure, trait units, and the number of traits. Mean dis¬ 
tance can decline when intermediate taxa are added 
to communities and it can similarly increase when 
intermediate taxa are extirpated or go extinct. This 
means MD does not meet the monotonicity criterion 
(Ricotta 2005) in that adding species with new com¬ 
binations of functional traits should increase trait 
diversity, or at least leave it unchanged. This ten¬ 
dency is most pronounced when comparing assem¬ 
blages in which all taxa are at the extremes of 
the trait ranges to assemblages with taxa near the 
middle: 


MD = 


2! (S — 2)! 
S! 


E E d n 


<=1 7=/+l 


where S is richness and d, j is the distance between 
species i and j. 

Minimum spanning tree ( MST ) (Ricklefs & 
Travis 1980; Moulton & Pimm 1987) is the length 
of the minimum non-recursive (i.e. no loops) tree 
connecting all species in trait space (i.e. the short¬ 
est 'connect-the-dots' distance). MST is a multidi¬ 
mensional version of range because it reduces to 
the range for a single trait. Species that are directly 
intermediate do not add to the total length, but in 
practice MST tends to increase monotonically with 
richness, even with random communities (see also 
Ricotta and Moretti (2008) and omst)- MST is con¬ 
ceptually similar to the dendrogram-based methods 
(FD), but it is simpler because it does not require 
grouping points and recalculating distances. MST 
is sensitive to distance measure, trait units, and the 
number of traits. 

Quadratic entropy (FDq) (Rao 1982; Shimitani 
2001; Botta-Dukat 2005) is the sum of the distances 
between pairs of species in trait space, weighted 
by the product of their relative abundances. 
The weighting factor is also the joint probability 
of choosing an individual of each species. FDq 
is conceptually the abundance-weighted mean 


distance. Weighting each distance by abundance 
means that FDq will increase as the most abun¬ 
dant species have increasingly extreme traits. FDq 
is analogous to the weighted mean phylogenetic 
distance (MPD; Chapter 14). FDq is independent of 
the number of traits, but it is sensitive to the scale 
of the traits. It can decline when intermediate taxa 
are added and increase when intermediate taxa are 
removed, especially when taxa otherwise load at 
trait extremes (Pavoine et al. 2005a): 

s-i s 

FD o = E E d <iPiPj 

1=1 /=/+1 

where d t j is the distance between species i and j, 
and p is the relative abundance. 

Botta-Dukat (2005) suggested using the mean dis¬ 
tance (either Euclidean or Manhattan) for each trait 
divided by the number of traits: 

J 

di-i = j E/^r 

k =1 

1^1 , 

“>■/ - J 2_, 4* ~ 4* I 

k=l 

In this formulation, the distances are always Man¬ 
hattan. Alternatively, it is possible to use the 
Euclidean distance divided by the maximum pos¬ 
sible distance, VT: 


4 / = 

1 VT, 


E ( tik - *m ) 2 


In practice, this standardization to ensure that FDq 
is bounded to a maximum of 1 is probably unneces¬ 
sary; one might just as easily use the Euclidean dis¬ 
tance for calculating FDq. Using the data described 
below, the two methods were strongly correlated 
(r = 0.97). However, the use of Euclidean distance 
with FDq means that it will reach a maximum 
when taxa are evenly abundant and each sits at 
the periphery of the trait space of the species pool 
(Pavoine et al. (2005a) and cf. FDiv and FEve). More¬ 
over, the behaviour of FDq depends on the choice 
of distance measure (Shimitani 2001). Pavoine et al. 
(2005a) suggest using ultrametric distance matrices 
(i.e. distance metrics that come from a tree, such 
as an MST) because this maximizes FDq when all 
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taxa are included. This approach would avoid the 
potential problem of recommending that some taxa 
should be extirpated in order to increase functional 
diversity. 

If all the taxa are completely different (i.e. all 
dij = 1), then FDq reduces to the Simpson diver¬ 
sity index. If all the taxa have equal abundance 
(of if there is no abundance information), then 
FDq reduces to FAD/S 2 . If abundance informa¬ 
tion is unavailable, it may be better to simply 
use MD. 

Nearest-neighbour metrics:density of species 
packing (meanNND) (Ricklefs & Travis 1980) reg¬ 
ularity of species packing ( sdNND ) (Ricklefs & 
Travis 1980), and evenness of species packing 
(evenNND) have been used to assess whether there 
are consistent limits to similarity among coexist¬ 
ing species. MeanNND has been used to assess the 
degree of trait dispersion in terms of the aver¬ 
age limit to similarity within assemblages. Com¬ 
munities with high trait dispersion also have high 
trait diversity. Mean NND can also be calculated 
as the abundance-weighted mean (see also Chap¬ 
ter 14). The regularity of species packing has been 
measured as the standard deviation of the nearest- 
neighbour distances between coexisting taxa in trait 
space. Small sdNND indicates regularity or even¬ 
ness of species spacing in trait space. This approach 
was borrowed directly from the study of spatial pat¬ 
terning. These methods differ from FEve in that only 
the nearest-neighbour distances (NND) are consid¬ 
ered. Therefore, sdNND measures the evenness in 
species packing as opposed to the evenness of trait 
space utilization. Although this may seem to be a 
subtle difference, the use of NND means that any 
long gaps in the distribution of species in trait space 
are ignored. 

If we apply the numerical approach of func¬ 
tional evenness (FEve) to the evenness in nearest- 
neighbour distances, then the index will have sim¬ 
ilar properties (i.e. independence with the amount 
of occupied trait space and species richness), which 
should be an improvement over simply measuring 
the standard deviation of NND: 

s 

£ min 

;=i 

evenNND = - 


Here, we standardize by the inverse of the number 
of species because each species has a nearest neigh¬ 
bour. 

Slope of the cumulative MST curve (a MS r) 
(Ricotta & Moretti 2008) is the degree to which 
the MST grows in length as taxa are added, after 
ranking the branch lengths from smallest to largest. 
The total lengths of tree-based indices, such as 
MST and FDpc, have a strong tendency to increase 
with species richness. The rate of increase as a 
function of species rank controls for variation in 
richness because it measures only the degree of 
increase in MST branch length with the rank of the 
branch length. This index was specifically designed 
to be independent of species richness. Low val¬ 
ues of this index indicate a high degree of func¬ 
tional redundancy and therefore low functional 
diversity. 

To calculate a M sr/ rank the branches from the 
MST from smallest to largest and then calculate the 
cumulative branch length at each rank. Then cimst 
can be calculated as the slope of the relationship 
between the cumulative branch length and rank on 
a log scale: 

cbl = c (Rankkrflrtc/i./engf/i) 

where cbl is the cumulative branch length from the 
MST after first ranking the branches from smallest 
to largest. 

Sum of distances or functional attribute diver¬ 
sity (FAD) (Walker et al. 1999) and number of cat¬ 
egorical trait states (T) (Mayfield et al. 2005) are 
the sum of the distances between coexisting taxa in 
trait space, while the modified functional attribute 
diversity ( MFAD ) (Schmera et al. 2009) is FAD stan¬ 
dardized to the number of functionally unique taxa. 
FAD is analogous to the sum of the phylogenetic 
distances (SPD; Chapter 14). FAD is generally not 
recommended because it does meet the twinning 
criterion, in that functional diversity should not 
increase if an identical taxon is added or if a taxon is 
split into two identical taxa (Weitzman 1992; Mason 
et al. 2003,2005). This makes FAD very tightly liked 
to species richness. In order to address this, the 
modified FAD (MFAD) was developed (Schmera 
et al. 2009). The approach first lumps taxa with 
identical trait values (across all traits) into a sin¬ 
gle functional unit while each unique taxon is con¬ 
sidered to be a functional unit. Then the sum of 
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distances between functional units is standardized 
to the number of functional units (F). This makes 
MFAD similar to the mean distance, but improves 
on it by removing the cost of functionally equiva¬ 
lent taxa (with zero distance) by lumping them. It is 
also specifically designed to handle nominal traits: 

FAD= S f± dij 

«= 1 7 =i+l 

where djj is the distance between species i and j. 

A4FAD = jYE d '-> 

1=1 ;=/+1 

where d, j is the distance between the unique func¬ 
tional units. If taxa are assigned to functional 
groups, then FAD is the number of groups. 

Schmera et al. (2009) suggested using the 
Marczewski-Steinhaus distance: 

21 |f/.fc — tj.k\ 

di., - r 

2 ] max tjj t) 

k =1 

or the Canberra distance. They found MFAD per¬ 
formed very well with both artificial and observed 
data in which the traits were binomial, ordinal, or 
fuzzy. Their taxa had a significant amount of func¬ 
tional redundancy and so it is not clear how MFAD 
performs with continuous traits where there is little 
or no redundancy. 

Mayfield et al. (2005) used a similar approach to 
FAD when dealing with sets of categorical traits. 
They used the number of categorical trait states 
present in a community for six traits, each of which 
had as many as 13 states. Their approach was basi¬ 
cally univariate, in that they simply summed the 
number of trait states (i.e. FAD) across the six traits, 
rather than use a multivariate distance measure 
which would have captured the number of unique 
combinations of trait states. They found that the 
number of trait states was not strongly correlated 
with species richness, nor was there evidence for 
a rapid saturation of trait states with increasing 
richness. 

Variance-based functional diversity (FD var ) 
(Mason et al. 2003, 2005; Leps et al. 2006; 
de Bello et al. 2009) was developed as the 
abundance-weighted sum of squares of the 


observed trait values and therefore it is analogous 
to the calculation of a weighted variance. 
This method must be applied to each trait 
singly, and then it can be summed or averaged 
across traits. Because it employs logarithms, 
the trait values must be greater than 0. Like 
several other indices, FD var will increase as 
the abundance of intermediate species declines 
to 0. 

If there are S species, with a relative abundance, 
p, and a single trait of values t: 


FD 


var 


2 


■n arctan 


5X>/ (Inf, 


L i=i 



s 

taw. in ~ 2 1 Pi f 
i =1 


where t aw . ln is the abundance-weighted logarithmic 
mean of the trait. 

The dressing around the summation helpfully 
puts the index into the range of about 0-1 (Mason 
et al. 2003). Alternatively, the abundance-weighted 
variance may be calculated without standardization 
(Leps et al. 2006; de Bello et al. 2009): 

s 

LDvar.2 = Y Pi { l i ~ tawf 
1=1 

s 

taw — 2 1 P‘ h 
1=1 


13.2.3 Partitioning the components of trait 
diversity 

Inter- and intraspecific components of trait diversity 
This is a method that is used to calculate both the 
among-taxa variance in trait values and the within- 
taxa variance in trait values, following from the 
calculations for analysis of variance. This approach 
explicitly includes intraspecific variation in trait 
values: 


s lnong = Y ^ (f- - rf 
/'= 1 

s 

f = 2 >fi 
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where f, is the mean trait value for species i and p 
is its relative abundance. Then, 


1=1 

where s 2 is the within-taxa variance in trait val¬ 
ues (Leps et al. 2006). Then the sum of among-taxa 
variance and within-taxa variance can be used as a 
measure of trait diversity, and the variance compo¬ 
nents add another level of detail. 

Within- (oc) and among- (|3) community components 
of trait diversity 

Following from Whittaker's concepts of a, |3, and 
y diversity (see also Chapter 6), one may similarly 
assess both the within-community a trait diversity 
and the extent of turnover in traits across com¬ 
munities (P). The partitioning may be done addi- 
tively (where y = oc + P) or multiplieatively (where 
Y = « x p). y is determined as the trait diversity of 
the collection of species and their overall relative 
abundances. Comparing the relative contributions 
of a and P in terms of species and traits can be par¬ 
ticularly enlightening. For example, de Bello et al. 
(2009) found that a trait diversity accounted for a 
much larger proportion of y than species richness- 
based diversity, which suggests a high degree of 
functional redundancy in the species pool. 

13.2.4 Methodological issues 

Traits: selection and measurement 
While this issue cannot be reviewed in detail here, 
the general aim is to capture the functional traits of 
species or taxa that have particular meaning for the 
questions of interest, whether they are about con¬ 
straints on community assembly, effects on ecosys¬ 
tem processes, or something else. One approach 
includes first identifying the principal challenges or 
limitations on species performance, and then identi¬ 
fying both the ideal attributes and any simple, mea¬ 
surable phenotypic correlate of the ideal attribute. 
For example, for plants, key challenges include dis¬ 
persal, establishment, and persistence, and it was 
recognized that seed mass, leaf economic traits, and 
height may serve as a minimal core set of traits, 
while additional key traits have been identified 
(Westoby 1999; Weiher et al. 1999; Comelissen et al. 


2003). If the questions relate to ecosystem function, 
then a subset of these might be of primary interest. 
A second approach has been to focus on traits asso¬ 
ciated with life history and resource acquisition in 
the broadest sense, including the ecomorphology of 
mobility, feeding structures, and resource use (e.g. 
Ben-Moshe et al. 2001; Lamouroux et al. 2004; Poff 
et al. 2006; Gurd 2007). For animals, example traits 
include body size, mouth size and shape, special¬ 
ized feeding structures (e.g. dentition in mammals, 
beaks/bills in birds, mandible parts in arthropods), 
the ecomorphology of locomotion (e.g. body shape, 
wing size, shape), phenology (e.g. crepuscular, or 
nocturnal, dormancy or migratory), and life history 
(e.g. parity, fecundity, egg/offspring size). Some 
authors have used prey use/trophic level as traits, 
when they are actually measures of functional role 
and niche. Similarly, habitat affinity has been used 
even though this is not a trait, but a measure of 
realized niche. 

When using continuous traits, it is common to 
use species mean values, even though it is widely 
recognized that continuous functional traits can 
have considerable within-species variability. The 
justification for using species means is based on 
the observation that species rankings by functional 
traits tend to be consistent when sampling across 
time and space (e.g. Gamier et al. 2001) and because 
the differences between species tend to be larger 
than the differences within species. For example, 
Kraft and Ackerly (2009) reported that specific 
leaf area varied 1.7-fold within species and 9-fold 
among species. Several approaches have been sug¬ 
gested to deal with intraspecific trait variation. One 
approach is to measure the continuous traits in each 
sample or in each individual (e.g. Cianciaruso et al. 
2009). A second approach is to use the variability 
in trait values to estimate the degree of overlap, 
and then use overlap as the complement of distance 
(see also section 1.4.3) (e.g. Mason et al. 2003; Leps 
et al. 2006). A third approach is to use intraspe¬ 
cific variability as an indicator of niche breadth 
(e.g. Violle & Jiang 2009), in other words as a trait. 
Intraspecific trait variation is conceptually prob¬ 
lematic because it arises through a combination of 
both meaningful factors (e.g. genotypic variation, 
phenotypic plasticity, acclimation, and ontogeny) 
as well as by measurement error and background 
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noise. Care should be taken when estimating trait 
variance (because trait data are rarely collected ran¬ 
domly or evenly across taxa, some kind of resam¬ 
pling method should be used to estimate variances, 
e.g. Rust and Rao (1996)). 

Trait scaling and independence 

When calculating trait diversity, it is usually impor¬ 
tant to standardize the trait data because traits are 
measured with different units and this means they 
have a variety of ranges and variances. There is 
currently no simple consensus on the best way to 
do this. One solution is to z transform quantitative 
trait data (subtract the mean and divide by the stan¬ 
dard deviation). This is appropriate when compar¬ 
ing trait diversity among samples within a study 
because it gives each trait the same weight by elim¬ 
inating any effect of measurement units. However, 
when traits are standardized by their standard devi¬ 
ation, it has the effect of putting them in standard 
deviation units, and so it is not possible to com¬ 
pare trait diversity across studies (unless the traits 
have equal variances, which is, of course, extremely 
unlikely). Furthermore, if some of the traits are not 
continuous variables, then scaling by the standard 
deviation makes no sense (Podani & Schmera 2006). 

An alternative might be to log transform contin¬ 
uous traits. This would equalize the units of the 
traits, where one unit would represent a 10-fold 
change. It would not change any differences in trait 
range or variance, so the traits would not contribute 
equally, but it would help in cases where one trait 
ranges over several orders of magnitude and others 
do not. 

A solution is to standardize the variables to their 
relevant range by subtracting the minimum then 
dividing by their range, so that the variables are 
expressed as a proportion of their entire observed 
range. This is the standardization used in Gower 
distances (e.g. Podani & Schmera 2006) and it has 
been suggested several other times (Botta-Dukat 
2005; Mason et al. 2005; Grace 2006). The relevant 
range may be defined by the. observed data set 
(i.e. the observed minimum and maximum), some 
regionally appropriate data range or, perhaps ide¬ 
ally, the global range. However, global ranges often 
suffer from extreme trait values that may have lit¬ 
tle meaning for a particular study (e.g. mammalian 


body size has a maximum of over 30 m and leaf 
mass per area has a maximum over 1400 g m -2 ). For 
some traits it might be quite reasonable to know the 
maximum and minimum global trait (e.g. height or 
body length), but for lesser-known traits it would 
helpful to establish clear physical limits (e.g. leaf 
dry matter content by definition has to be greater 
than 0 and less than 100, but we don't know its 
absolute physical limits). 

This will not, however, eliminate redundant traits 
that are not independent of each other and essen¬ 
tially measure a similar function or attribute. For 
example, many measures of functional morphology 
and ecophysiology are intrinsically linked to body 
size (Brown & West 2000) and smaller details, even 
functional traits of a single organ, such as specific 
leaf area, leaf dry matter content, leaf thickness, 
and leaf size, tend to be at least weakly correlated 
(Wright et al. 2004). If correlated measures of func¬ 
tion are used, then those functions or trait axes are 
over-weighted relative to other functions or trait 
axes. One solution would be to down-weight cor¬ 
related or redundant traits (Pavoine et al. 2009) so 
their total length or input weight would be 1 or 
equal to other functionally-linked traits. 

A more general solution is to collapse the trait 
axes by using principal components analyses (PCA) 
or principal coordinates analysis (PCoA). This leads 
to an equivalent number of uncorrelated trait axes, 
but if PCA is based on correlation, then the com¬ 
ponents will be in meaningless standard deviation 
units. If the traits are first standardized to their rel¬ 
evant range and then collapsed into principal com¬ 
ponents using covariances rather than correlation, 
then the resultant components should be uncorre¬ 
lated and unique, but they will retain their units 
(percentage of trait range). Alternatively, (PCoA) 
can be based on any distance measure. 

Even though we should be careful about using 
redundant or correlated traits, we should not nec¬ 
essarily avoid them altogether. Leps et al. (2006) 
found that correlated traits do not necessarily pro¬ 
duce particularly correlated trait diversities. They 
also explained that traits can be correlated due to 
physical constraints, but they may relate to dif¬ 
ferent, or at least distinctive, ecological processes. 
For example, offspring size and fecundity tend to 
be negatively related because of a fundamental 
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trade-off, but they relate to different ecological 
processes (e.g. survivorship and dispersal vs repro¬ 
ductive output). It may therefore be important to 
include such correlated traits in order to capture 
particular ecological processes. 

Distance/dissimilarity methods 
Euclidean distances are commonly used to measure 
distances between taxa in trait space. This is accept¬ 
able as long as the trait data are quantitative and 
continuous. However, when trait data are ordinal, 
binomial/nominal (e.g. wings or no wings), fuzzy 
(e.g. animals that are opportunistically carnivorous, 
but are often not so), or circular (e.g. phenology by 
month of year), then Euclidean distances become 
less meaningful. In addition, such data should not 
be z transformed. A solution is to use Gower's dis¬ 
tance (GD; Podani 2005; Pavoine et al. 2009). 

Gower's distance is the average Manhattan met¬ 
ric across traits, after rescaling them to their rele¬ 
vant range. The relevant range (Rk) can be defined 
as either the observed range of trait values or the 
regional or globally observed range. 

Zi.k — hk / Rk 
1 T 

GDi j = — y ' Zj k — Zj k | 
k=l 

Gower's distance can be further generalized to 
adding weighting to each trait. The function gowdis 
is available in the FD package in R. 

Matching dissimilarity (MD) is a distance 
approach that estimates niche overlap, and 
is estimated by the degree of overlap in the 
distribution of trait values between two taxa 
(Mason et al. 2003; Leps et al. 2006): 



For qualitative or binomial traits with no intraspe¬ 
cific variation, MD is the same as Gower's dis¬ 
tance. For quantitative traits, is can be estimated by 
calculating the overlap in the probability density 
functions of each trait for each pair of coexisting 
taxa. Leps et al. (2006) suggest using the trait mean 
and standard deviation with the approximation of 
the normal distribution to calculate overlap. Any 
species pair that has zero overlap will have MD = 1 


and so there may be considerable loss of distance 
information when using this metric. One might con¬ 
sider using the matching dissimilarity to down¬ 
weight other distance measures by the amount of 
overlap, for example: 

ED° = E Di j x MDi.j 

Rarefaction 

When organisms are sampled incompletely, with 
variation in sample size or with variation in the spa¬ 
tial arrangement of samples, biodiversity indices 
are biased and are not readily comparable (see 
Chapter 2). A solution is to use rarefaction, where 
unbiased biodiversity indices are either calculated 
or bootstrapped (see Chapter 4). Rarefaction is usu¬ 
ally not necessary with true census data, such as a 
diversity of sessile organisms in small areas. How¬ 
ever, at large spatial grain size, when combining 
samples to estimate the diversity of large spatial 
grain sizes, when combining samples with different 
spatial arrangement of data points, or when sam¬ 
pling motile animals, rarefaction methods are usu¬ 
ally necessary. 

Walker et al. (2008) provided suggestions for rar¬ 
efying trait diversity. They described four algo¬ 
rithms for generating species lists for a random 
sample of n individuals or m sample units. Gotelli 
and Colwell (Chapter 4) recommend using sample- 
based algorithms, with adjustment for the num¬ 
ber of individuals sampled. Walker et al.'s study 
showed how rarefaction was helpful in determining 
the unbiased change in functional diversity over 
time and how it also helped to show that extirpated 
species were functionally redundant. 

Abundances are not equivalent 

Any index that incorporates abundance informa¬ 
tion is sensitive to the way in which abundances are 
measured. Evenness measures are particularly sen¬ 
sitive (see Chapter 5). One cannot compare diver¬ 
sity indices that were calculated using different 
abundance units. There is a general tendency for 
biomass-based abundances to have a much greater 
difference between the maximum abundance and 
the minimum abundance, with cover estimates 
intermediate and frequency (i.e. counts of individu¬ 
als) having the least spread (e.g. Leps et al. 2006). 
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This means biomass-based indices will be biased 
to lower diversity and evenness than the other 
measures. 

Effective numbers transforms of trait diversity 
If half the trait diversity is lost, the index should 
also be reduced by half. The indices of trait rich¬ 
ness ( CHV, FD, MST) will generally conform to this. 
However, when abundance information is used, it 
is often transformed or recombined in a way that 
distorts the direct interpretation of the index (Jost 
2006). A solution is to translate the index back to 
the 'numbers equivalent' (Jost 2006). For FDq, this 
has been solved because FDq reduces to the Simp¬ 
son diversity if the distance between every species 
is 1, but more work is needed to assess the other 
indices. 

13.2.5 Conceptual issues 

Is functional diversity a unitary latent variable? 

One of the principal questions involving functional 
diversity is whether there is one best measure and 
there has been considerable debate over which mea¬ 
sures and methods are most ideal. In the recent lit¬ 
erature, there has been an emphasis on measuring 
functional diversity in order to capture its potential 
effects on ecosystem functioning, while much of the 
earlier interest was in capturing the potential effects 
of environmental filtering and biotic interactions on 
community composition (i.e. assembly rules). We 
should make every effort to be transparent about 
our goals and be open to the notion that the dif¬ 
ferent measures of trait diversity may be more or 
less insightful depending on the questions at hand. 
Even so, we should consider whether the various 
indices are essentially measuring the same concep¬ 
tual construct. 

Trait diversity is a conceptual construct that 
we cannot measure directly (cf. temperature), but 
instead we must use one or more indicators of 
the concept in order to indirectly measure it. This 
brings up the question of reliability and whether it 
is in fact a singular latent variable, or if the vari¬ 
ous indices are indeed measuring distinct aspects of 
diversity. To test this, I used principal components 
analysis to assess the correlation structure among 
the indices. I calculated the indices using data from 


an experimental grassland (524 1 m 2 quadrats with 
richness from 3 to 15 species, 66 plant species, and 
two independent functional traits, height and leaf 
dry matter content rescaled from 0 to 1 (to avoid 
zero CHV values) and artificial data (equiprobable 
random richness from 3 to 20 species, with random 
species abundances, and two random traits)). 

There were strong correlations between a core 
set of trait diversity indices (CHV, FAD, FD lf FD 2 , 
FDis, FD var i, FD var 2 , FDq, MFAD, and MST) in 
both observed plant assemblages and artificial data. 
These were manifested as nearly equivalent load¬ 
ings of these indices on the first principal compo¬ 
nent of both analyses, and each explained over 50% 
of the variation in the 15 indices (Table 13.2). The 
mean Pearson correlation between the indices load¬ 
ing heavily on the first principal component was 
0.817, suggesting good reliability in their measure¬ 
ment of trait diversity in the observed plant assem¬ 
blages, but it was less reliable in the artificial data 
(f = 0.719). 

The PCA suggested that there are about 
four unique aspects of trait diversity (Fig. 13.1, 
Table 13.2). The core set that loaded heavily on 
the first component was separated into those 
incorporating abundance data and those that did 
not on the third component in grasslands and 
on the second component in artificial data. This 
suggests that in real communities the distinction 
between trait richness and trait diversity may be 
unnecessary. A second aspect can be interpreted 
as the general spacing patterns, as measured 
by nearest-neighbour metrics, mean distance, 
and the slope of the cumulative MST curve. The 
strong correlation between a MST and MD was 
unexpected (r = 0.80 and 0.73 for observed and 
artificial data, respectively). In grasslands these 
indices were separated from the others on the 
second component, while they were separated on 
the fourth component in random communities. A 
third aspect was functional evenness (FEve and 
evenNND). They were more strongly correlated 
with random data (r = 0.504) than grassland data 
(r = 0.318). This may have been caused by the 
random assignment of abundances, which would 
tend to increase species evenness in comparison 
to real data. Functional divergence (FDiv) was a 
distinct aspect of trait diversity. 
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Figure 13.1 Principal components analysis of 16 indices of trait and functional diversity. The upper ordinations are based on 524 observations from an 
experimental grassland plant community and two traits rescaled to range from 0 to 1, the lower is based on 400 simulated communities with random 
assembly, random abundance, and two random traits. Both used correlation. 


Is trait diversity independent of species richness and 
should we use a null model to correct for this? 

Nearly all of the measures are mathematically inde¬ 
pendent of richness in that adding a redundant 
taxon, or splitting a species into two identical taxa, 
does not cause an increase in trait diversity. Even so, 
species richness and trait diversity tend to be posi¬ 
tively correlated with both simulated and real data 
(Table 13.3). The trait diversity indices that loaded 
heavily on PC 1 were also strongly correlated with 
species richness. There was reasonable concordance 
between the two data sets. Of the remaining indices, 
only meanNND was negatively correlated with rich¬ 
ness, and this was presumably due to the simple 
requirement that the density of species packing will 
tend to decline with increasing richness. 


A principal cause of the correlations with species 
richness is the sampling effect: with increasing sam¬ 
ple size, there is a greater likelihood of collecting 
extreme values (e.g. Huston 1997; Leps et al. 2006). 
Variance-based indices (FD var ) have a tendency to 
increase with richness for same reason that mea¬ 
sures of variance tend to increase with sample size. 
Distance-based indices (e.g. FD, MST, MD, FDq) 
behave similarly. Convex hull volume also tends 
to increase due to the sampling effect, and because 
maximal hull volume occurs only when richness is 
equal to or greater than 2 T where T is the number 
of traits and when each taxon is on the perimeter 
(Villeger et al. 2008). 

Ricklefs and Travis (1980) interpreted the rela¬ 
tionship between trait richness and species richness 
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Table 13.2 Principal components analysis of 13 trait diversity indices 
using both observed plant assemblages and artificial data. PC, principal 
component. 



PC 1 

PC 2 

PC 3 

PC 4 

Plant assemblages 





Standard deviation 

3.060 

1.661 

1.027 

0.960 

Proportion of variance 

0.585 

0.172 

0.066 

0.058 

Cumulative proportion 

0.585 

0.757 

0.824 

0.881 

Loadings: 





a MST 

-0.138 

0.517 


-0.145 

CHV 

-0 304 

0.112 

-0.116 

-0.160 

FAD 

-0.293 


-0.305 


MFAD 

-0.314 


-0.246 


FDis 

-0.285 


0.176 

0.176 

FDiv 


-0.319 

0.428 

-0.629 

fOpGt 

-0.312 


-0.183 


FDp G2 

-0 284 

-0.147 

-0268 


FDq 

-0.293 


0.323 

0.200 

FDv arl 

-0.290 


0.262 


fD va , 2 

-0.292 


0.349 

0.163 

FEve 

-0.179 

-0.151 


0.541 

MD 

-0.158 

0.427 


-0.195 

MST 

—0.311 


-0.146 

-0.135 

meanNND 

0.144 

0.457 

0.269 


evenNND 

-0.105 

-0.395 


0.190 

Artificial data 





Standard deviation 

2.915 

1.735 

1.233 

1.063 

Proportion of variance 

0.531 

0.188 

0.095 

0.071 

Cumulative proportion 

0.531 

0.719 

0.814 

0.885 

Loadings: 





O-MST 

-0.228 

0.250 


-0461 

CHV 

-0324 




FAD 

-0.284 

-0.264 



MFAD 

-0 311 

-0.226 



FDis 

-0.285 

0.318 


0.233 

FDiv 


0.252 

0.228 

0.615 

fOpGI 

-0.275 

-0.153 


-0.109 

®PG2 

-0.267 

-0 303 



FDq 

-0.303 

0.204 


0.165 

FD\i arl 

-0 275 

0.175 



PDv ar2 

-0.268 

0.318 


0.150 

FEve 



-0.660 

0.281 

MD 

-0.160 

0.372 


-0288 

MST 

-0.326 

-0.136 


-0.136 

meanNND 

0.180 

0.372 

-0.224 

-0 317 

evenNND 



-0665 



Table 13.3 Pearson correlations between trait diver¬ 
sity indices and species richness in observed plant assem¬ 
blages and artificial data. 



Plant assemblages 

Artificial data 

a MST 

0.226 

0.399 

CHV 

0.847 

0.803 

FAD 

0.954 

0.972 

MFAD 

0.965 

0.977 

FDiv 

0.181 

0.018 

FDis 

0.723 

0.363 

FDpq\ 

0.905 

0.918 

EDfG2 

0.954 

0.938 

FDq 

0.760 

0.511 

FD\/ar\ 

0.737 

0.428 

FDvan 

0.727 

0.365 

FEve 

0.496 

0.121 

MD 

0.241 

<0.001 

MST 

0.889 

0.904 

meanNND 

-0.613 

-0.696 

evenNND 

0.356 

0.195 


as being due to the tendency for species to be added 
at the periphery of the trait space, rather than inter¬ 
nally within the trait space. While this is a reason¬ 
able idea, the tendency for random assemblages 
to also show this pattern suggests an inherent 
numerical bias. This is one reason why numerous 
authors have suggested comparing the observed 
trait diversity to what would be expected under 
a null model of independent species assortment 
(e.g. Strong et al. 1984; Villeger et al. 2008; Corn- 
well et al. 2006). This is probably most important 
if one wishes to compare trait diversity between 
regions with different species pools or between 
taxa at different trophic levels or those with differ¬ 
ent functional traits. Standardizing to a null model 
has been useful for investigating patterns of com¬ 
munity assembly (see also de Bello et al. (2009)), 
but it is less clear whether such standardizations 
will be helpful for understanding the effects of 
trait diversity on ecosystem processes. Standard¬ 
ized trait diversity can be thought of as the resid¬ 
ual diversity, after accounting for neutral processes. 
If trait diversity, per se, is causally important to 
ecosystems, then we might not expect the null- 
standardized index to matter. Alternatively, the 
null-standardized index may do a better job of 
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catching residual complementarity and actually be 
more predictive of ecosystem processes than the 
unstandardized index. 

When the correlations between the indices were 
conditioned on species richness using partial cor¬ 
relation, they tended to decline in the grassland 
data. This suggests the correlations between the 
indices are partly due to changes in species rich¬ 
ness (Poos et al. 2009). The mean decline in r was 
only 0.16, however, and this suggests that the cor¬ 
relations between the indices are mainly indepen¬ 
dent of species richness. FAD showed the greatest 
declines (mean decline in r = —0.41), which sug¬ 
gests that approximately half of the information in 
FAD is the same as species richness. 

With artificial data, there was no parallel trend, 
as the mean change in correlation was only —0.01. 
This means that the reliability of the indices is more 
strongly dependent on variation in species richness 
when working with real data than when working 
with artificial data. It also supports the conclusions 
of Poos et al. (2009) in which they suggest that 
greater care must taken when comparing the trait 
diversity of communities with similar species rich¬ 
ness because the methods are less reliable within a 
narrow range of diversity. 

13.3 Prospectus 

13.3.1 Recommendations 

No single best index 

I suggest that are four conceptual aspects to trait 
diversity: the occupation of trait space, which is 
measured by the indices associated with trait rich¬ 
ness and trait diversity ( CHV , FDis, FDpc, FD q, 
FD var , MFAD, MST), functional evenness, func¬ 
tional divergence, and the density of species pack¬ 
ing. Because the core indices are strongly con¬ 
cordant, it may not matter which is used, but 
a best practice might be to calculate them all 
and then use the one(s) that are most instruc¬ 
tive. The distinction between trait diversity and 
trait richness appears to be somewhat artificial and 
may be unnecessary when using data from extant 
communities. 

Use care, forethought, and hedge your bets when 
using these indices. There are few, if any, bona 


fide standard practices at this point in time (but 
consider Petchey et al. (2009)). First, there is the 
general question of whether one should use an 
overall multivariate measure of trait diversity that 
encompasses all reasonable traits or whether one 
should calculate trait diversity for each trait sep¬ 
arately. The answer depends on one's goal. If the 
goal is to assess, as thoroughly as possible, the 
amount of trait diversity in a community, then a 
multivariate measure is most appropriate. If, how¬ 
ever, the goal is to understand the constraints on 
community assembly, then a trait-by-trait approach 
may be more enlightening (e.g. Weiher et al. 1998; 
Cavender-Bares et al. 2004). Similarly, if the goal 
is to understand effects on ecosystem processes, 
then choosing a subset of traits that are likely to 
relate to the process of interest in probably the 
best approach (e.g. Leps et al. 2006). Also consider 
using measures of phylogenetic diversity (Chap¬ 
ter 14), as these are sometimes more predictive of 
ecosystem processes than trait-based measures (e.g. 
Cadotte et al. 2009). 

We also need to remember that we are dealing 
with indices of trait and functional diversity, and 
therefore no individual index should be considered 
to be the true definition of trait or functional diver¬ 
sity. Trait diversity is a latent variable, that is, it is a 
conceptual attribute of an assemblage that can only 
be measured indirectly via one or more indices. 
The principal method for dealing with latent vari¬ 
ables is through confirmatory factor analysis within 
a structural equation modelling framework (Grace 
2006). Such an approach is intellectually hon¬ 
est, in that it generalizes trait diversity by utiliz¬ 
ing the common information in a set of parallel 
indicators. 

Consider the indices critically 

FDq is the most intuitively attractive index of func¬ 
tional diversity because of its simplicity, incorpo¬ 
ration of abundance, and parallels with Simpson 
diversity and MPD (see also Leps et al. (2006) and 
Chapter 14). Prior to writing this review, I tried 
to develop my own index from first principles. 
I was somewhat disappointed to have arrived at 
FDq. FDis is similarly intuitive and provides sim¬ 
ilar values. FDvar is also intuitive, but is not truly 
multivariate. MD is conceptually linked to FDq 
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and its phylogenetic analogue, MPD. All of these 
can sometimes produce lower trait diversity when 
intermediate taxa are added. Perhaps this is only 
a theoretical issue because both real and artificial 
data do not show any real problem with declining 
trait diversity with richness. Care should be taken 
if one is providing advice on how to maximize 
diversity; in such cases it is best to follow Pavoine 
et al. (2005a) and use an ultrametric distance 
matrix. 

The Convex hull volume is very attractive for 
estimating the volume of trait space occupied by a 
community. However, as a measure of trait diver¬ 
sity or richness it has three important drawbacks. 
The first drawback is the requirement that a com¬ 
munity have one more species present than it has 
trait dimensions. This means that communities with 
few species will have a convex hull volume of 0 
if the number of traits is greater than a few. One 
could state that the convex hull volume in such 
cases is undefined, but this is more of a seman¬ 
tical dodge than a solution. In geometry, it takes 
at least four points to define a three-dimensional 
volume, and so three points in space do not have 
a volume. So we are left with the fact that if there 
are three traits of interest, a three-species commu¬ 
nity has zero trait diversity. One way to avoid this 
problem is to use only the first two principal com¬ 
ponents of the trait data and thus communities of 
three species could have a two-dimensional convex 
hull area. This still leaves the two-species commu¬ 
nity in limbo with a volume of 0 and it means we 
must ignore some of the trait information. Secondly, 
CHV will tend to scale with increasing richness 
because maximum CHV will occur at 2 r species. 
Lastly, the convex hull volume does not increase if 
a taxon with intermediate trait values is added to 
an assemblage. As a measure of trait volume this 
is reasonable, but as a measure of trait diversity 
some may view this as inappropriate. This ques¬ 
tion seems to be a matter of opinion, more than 
of fact, so we should not privilege one over the 
other. 

MST is the most intuitively attractive of the tree- 
based methods and it also avoids the question of 
clustering method. Even so, FD P g 2 is strictly anal¬ 
ogous to PD. These methods may be the only 
solution for estimating the volume of occupied 


trait space if the trait dimensionality is more than 
two or three and some samples have low species 
richness. 

Be aware of critical decisions (Box 13.1) 


60x13.1 Key questions and steps for 
calculating trait and functional diversity 

A. Species by traits data 

a. Which traits? 

b. How are traits measured? 

i. Continuous, ordinal, binomial, or nominal? 

This has implications for choosing distance 
measures and trait rescaling. 

c. More than one trait? If so, rescale traits and 

collapse correlated traits with PCoA or PCA. 

i. Rescale to relative range (so that trait values 
range from 0 to 1 based on the observed trait 
values), or regional or global range, with z 
transform, or some other appropriate method. 

ii. In general, do not z transform ordinal, 
binomial, or nominal traits. 

iii. Note that PCoA is more generic and can use a 
variety of distance methods. PCA on 
covariance = PCoA if using Euclidean 
distances, PCA on covariance with 
z-transformed data = PCA on correlation. 

B. Samples by species data 

a. How is abundance measured? 


13.3.2 Future directions 

Some enduring questions remain. There is rel¬ 
atively little understanding of how trait diver¬ 
sity varies along real gradients (but see Swenson 
and Enquist (2007), Pausas and Verdu (2008), and 
Filippi-Codaccioni et al. (2009)). Functional ecol¬ 
ogists have largely focused on shifts in commu¬ 
nity mean values (e.g. Cornwell & Ackerly 2009) 
or have used independent contrasts (i.e. consistent 
shifts among sets of congeners, e.g. Cunningham 
et al. (1999)) to assess how functional traits shift 
along gradients. Historically, community ecologists 
have focused on how communities differ from neu¬ 
tral communities, rather than applying gradient 
approaches. If functional diversity is not perfectly 
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correlated with species richness, is it under the 
same sorts of controls as species richness? If it is 
not, then what are the controls on it and how do we 
understand the interrelationships between species 
richness and functional diversity? 

One way to approach the question of how trait 
diversity is related to species richness is to recog¬ 
nize that the volume of occupied trait space is a 
simple function of the number of taxa times the 
average amount of trait space occupied per taxon: 

Richness tm i t rs Richness species x meanNND 

This approach could be useful because we can 
explicitly link MST to richness by 


MST = meanNND (S — 1) + redundancy 


std. redundancy 


MST -{meanNND (S- 1)) 
MST 


where the redundancy within a community would 
be the amount by which average branch length in 
the MST exceeds the meanNND. If each .taxon is 
unique and contributes equally to the MST, then 
there is no redundancy. As taxa have closer nearest 
neighbours relative to the average branch length of 
the MST, then there is increasing redundancy. At 
the limit, where meanNND = 0, then all the taxa are 
redundant and standard redundancy is 1 (i.e. all of 
the MST is from redundant taxa). 

Does functional diversity affect ecosystem 
processes? While there is some good evidence 
that ecosystem processes are indeed promoted by 
functional diversity (e.g. Petchey & Gaston 2004; 
Diaz et al. 2007; Cadotte et al. 2009), a general 
lack of quality functional trait data has hampered 
progress in this area, as many studies simply used 
the number of functional types in lieu of traits. 

How do these measures vary as a function of 
realistic changes in functional diversity, that is, if 
we double trait diversity, do we double the index of 
trait diversity? Do we need some kind of 'effective 
numbers' transforms for the indices that use abun¬ 
dance information? 

How can phylogenetic and functional app¬ 
roaches be combined to increase understanding, for 
example consider Cavender-Bares et al. (2009) and 
Pavoine et al. (2009)? 


13.4 Key points 

1. Many of the indices have shortcomings because 
of their intrinsic relationships with species rich¬ 
ness, their capacity to decrease when inter¬ 
mediate taxa are added, and their require¬ 
ments for three or more taxa, for the num¬ 
ber of taxa per trait, for needing non-zero trait 
values, or for less than clear standard meth¬ 
ods. In addition, the indices represent at least 
four components of diversity, therefore no best 
index for calculating trait diversity can be sin¬ 
gled out. Multiple indices should be used and 
compared. 

2. A subset of the indices of trait diversity was 
strongly correlated, even after accounting for 
variation in species richness, therefore many of 
the indices may be roughly equivalent indicators 
of trait diversity, especially if species richness 
and trait diversity have notable variation among 
the samples. This also suggests that trait diver¬ 
sity and trait richness may be largely a singular 
latent variable that can be measured with good 
reliability. 

3. Several indices measure distinctive aspects of 
trait diversity. These include: (a) functional 
divergence (FDiv), (b) functional evenness (FEve 
and evenNND), and (c) nearest-neighbour met¬ 
rics (aMsr and meanNND). 

4. Like other diversity indices, trait diversity is sen¬ 
sitive to a variety of methodological choices and 
there is no sound way to compare trait diversi¬ 
ties except when produced using absolutely par¬ 
allel methods. 

5. There are great opportunities for the further 
development of trait diversity indices, for under¬ 
standing the environmental controls on trait 
diversity, and for understanding the effects of 
trait diversity on ecosystems. These questions 
remain open. 
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CHAPTER 14 


Measuring phylogenetic biodiversity 

Mark Vellend, William K. Cornwell, Karen Magnuson-Ford, 
and Arne 0. Mooers 


14.1 Introduction 

14.1.1 Overview 

Biodiversity has been described as the 'biology of 
numbers and difference' (Gaston 1996). Because 
species are different from one another, traditional 
metrics of biodiversity such as species richness 
or evenness increase when there are more species 
or when abundance is more equally apportioned 
among these species (see Chapters 4 and 5). Not 
only are species different from one another, the 
magnitude of these differences varies tremendously 
depending on the set of species in question. Con¬ 
sider two hypothetical islands, each with only two 
species of vertebrate animals in equal abundance: 
two birds in one case and a bird plus a mammal in 
the other. Both islands have species richness = 2 (for 
vertebrates) and the same maximal value of species 
evenness. However, our intuition tells us that a bird 
plus a mammal represents more biodiversity than 
does two birds (Purvis & Hector 2000). Metrics of 
phylogenetic diversity quantify the difference. 

Differences among species can be characterized 
by measuring any number of traits, such as body 
size and shape, dietary requirements, physiologi¬ 
cal tolerance of various stressors, etc. (see Chap¬ 
ter 17). Particular traits may be of special interest 
to a researcher for various reasons, such as their 
hypothesized role in mediating species interactions 
(e.g. beak size in birds) or their importance in 
tolerating different environmental conditions (e.g. 
leaf thickness in plants). However, the degree of 
similarity or difference among species will depend 
strongly on the choice of traits measured, and many 
traits are only applicable to particular groups of 
organisms (e.g. photosynthetic rate in plants). A 


far more general method for quantifying diversity 
among species is an assessment of the species' evo¬ 
lutionary relationships, in the form of either tax¬ 
onomy or a phylogeny. Modern phylogenies are 
derived from DNA-sequence data, which can be 
acquired for all organisms on the tree of life. The 
phylogenetic distance between two species is an 
estimate of the amount of time since the most recent 
common ancestor of both species, in other words 
the time that each has evolved independently of 
the other. While individual traits may show 'con¬ 
vergence', that is, similar values evolving in dis¬ 
tantly related lineages, the phylogenetic distance 
represents a proxy for the magnitude of pheno¬ 
typic differences (across a large number of traits) 
expected between any two species (Cavender-Bares 
et al. 2009). 

Biologists have been interested in the phyloge¬ 
netic component of biodiversity for two main rea¬ 
sons: (i) to explicitly incorporate species differences 
(via a common currency applicable to all taxa from 
bacteria to primates), rather than just species num¬ 
bers, into conservation prioritization and (ii) to 
yield insights into the structure of ecological com¬ 
munities. In the first case, recognizing that difficult 
prioritization decisions need to be made concerning 
the investment of limited resources for conserva¬ 
tion, it has been argued that the aim should not 
be just to protect the greatest number of species 
possible, but to protect sets of species that are most 
taxonomically distinct or that represent the great¬ 
est possible variety of biological features (Vane- 
Wright et al. 1991; Faith 1992; Mooers et al. 2005; 
Isaac et al. 2007). To this end, considerable effort 
has been aimed at quantifying the evolutionary 
distinctness (and therefore conservation value) of 
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individual species, or the 'phylogenetic diversity' of 
a group of species (Vane-Wright et al. 1991; Faith 
1992; Altschul & Lipman 1990; Nixon & Wheeler 
1992; Pavoine et al. 2005b; Redding and Mooers 
2006). 

In terms of analyses in community ecology, the 
incorporation of phylogenetic information has a rel¬ 
atively long history. Darwin (1859) first hypothe¬ 
sized that competition should be strongest between 
close relatives (e.g. congeners), leading subsequent 
researchers to explore ratios of species-to-genus 
numbers (or genus-to-family, etc.) as potentially 
indicative of the role of competition in structur¬ 
ing ecological communities (e.g, Elton 1946). More 
recently, the same conceptual question has been 
approached using modern phylogenies, which con¬ 
tain far more information on evolutionary relation¬ 
ships than taxonomic categories (reviewed in Webb 
et al. (2002), Vamosi et al. (2009), and Cavender- 
Bares et al. (2009)). If indeed close relatives compete 
most strongly, local communities should contain 
species that are relatively distantly related to one 
another. Alternatively, species membership in a 
local community might be most constrained by tol¬ 
erance of abiotic environmental conditions, and if 
close relatives share similar tolerances, local com¬ 
munities should contain species that are relatively 
closely related to one another. To test these hypothe¬ 
ses, researchers have employed some of the phy¬ 
logenetic diversity metrics from the conservation 
literature and also introduced some additional met¬ 
rics of their own (Webb et al. 2008). 

As applied to issues in both conservation biol¬ 
ogy and community ecology, phylogenetic diver¬ 
sity has been a topic of tremendous interest in the 
recent literature. As such, a large number of met¬ 
rics to quantify phylogenetic diversity have been 
introduced, and for some subsets of these metrics 
analyses have been done to assess their redundancy 
or the degree to which they meet certain pre-set 
criteria (e.g. Pavoine et al. 2005b; Kraft et al. 2007; 
Hardy 2008; Schweiger et al. 2008). In this chap¬ 
ter, we aim to provide guidance to researchers and 
practitioners for selecting particular metrics and 
for interpreting published results based on differ¬ 
ent metrics. After providing some important def¬ 
initions (see Box 14.1) and a conceptual overview, 
we will first offer a categorization of metrics found 


in the literature, according to the functional form 
of the calculation and the nature of the input data 
(e.g. species presence-absence vs abundance data). 
Next we report results of simulation analyses, in 
which artificial communities were created under 
different assumptions about the processes by which 
phylogenies arise, and by which membership and 
abundance in local communities are determined. 
The goals here were to assess quantitative relation¬ 
ships among different metrics (e.g. which behave 
similarly?) and to assess the sensitivity of different 
metrics to underlying evolutionary and ecological 
processes. We will then discuss the qualitative and 
quantitative relationships among metrics and how 
researchers can go about choosing metrics for dif¬ 
ferent purposes. 

14.1.2 Approaching the study of phylogenetic 
diversity 

The choice of metrics of phylogenetic diversity in 
empirical studies is entirely under the control of 
the researcher, but will be influenced by three key 
aspects of a particular system, which are typically 
not (or only partially) under a researcher's control: 
underlying processes, related patterns (other than 
phylogenetic diversity), and data constraints. 

A variety of evolutionary and ecological processes 
influence the values of phylogenetic diversity met¬ 
rics, either indirectly or directly. Macroevolutionary 
processes will create patterns that matter a great 
deal for phylogenetic community structure and the 
choice of metric. For example, the extent to which 
speciation is 'ecological' (i.e. driven by divergent 
selection) will affect the shape of the phylogeny 
and the phylogenetic conservatism of different traits. 
Both of these will in turn affect the extent to which 
species coexistence may be influenced by relatedness 
(Mooers & Heard 1997; Kembel & Hubbell 2006; 
Kraft et al. 2007; McPeek 2007, 2008). Ecological 
processes influencing community assembly, such as 
environmental constraints on fitness or competition 
for resources, will also influence phylogenetic diver¬ 
sity metrics (Webb et al. 2002). 

Some empirical patterns, including the distrib¬ 
ution of species numbers and abundances among 
sites, and the degree of balance in a phyloge¬ 
netic tree (see Box 14.1), may influence the range of 
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Box 14.1 Definitions of attributes of phylogenetic trees 


A rooted phylogenetic tree summarizes hypothesized 
evolutionary relationships among species or other biological 
units such as lineages within species. Phylogenetic trees 
can be estimated using a variety of methods (see 
Felsenstein 2004), the details of which are beyond the 
scope of this chapter. For the purposes of this discussion 
we will assume that the tips (sometimes referred to as 
'leaves') of the tree represent species (see Figure Box 14.1). 
A node represents the most recent common ancestor of all 
species descending from that point in the tree (i.e. where 
branches split) and the root node (often referred to simply 
as the root) is a single point from which it has been inferred 
that all species descend, thus giving the entire tree 
temporal directionality. The simplest type of phylogenetic 
tree represents only the topology, with no information on 
the lengths of branches connecting the nodes (e.g, 
taxonomies based on morphological data and some types 
of molecular data). We refer to such trees as node-based 
trees. 

A branch in the phylogenetic tree (also referred to as an 
'edge' in graph theory), and its associated branch length, 
may represent the accumulation of evolutionary change, in 
which case the tips may not line up because the rate of 
evolutionary change is not constant across all branches. 
Alternatively, branch lengths may be scaled to represent the 


passage of time, such that all tips line up in the same place. 
Each of these two types of phylogenetic trees are 
considered additive, and the latter type is additionally 
called ultrametric (all distances from root to tip are the 
same). We refer to trees with quantitative branch lengths 
as distance-based trees. 

Nodes are usually bifurcating, such that lineages split 
into two. Polytomies are nodes where this is not the case, 
and the lineage splits into three or more. This arises most 
commonly due to data limitations. The degree of a node is 
the number of branches, both ancestral and descendant, 
connected to that node (three for a bifurcating node). 

The shape of phylogenetic trees can be characterized by 
two key properties: their degree of balance and the degree 
to which divergence events happened predominantly early 
(divergence decelerating) or late (divergence accelerating) 
during the evolution of the group (the latter is characterized 
by the y statistic). In a perfectly balanced tree, all tips 
are separated from the root by the same number of nodes, 
which is equivalent to saying that all lineages bifurcate the 
same number of times. In a perfectly imbalanced tree, 
one lineage descending from each node connects directly to 
a tip with no further bifurcations. In a tree with low y 
divergence events are concentrated early during the 
evolution of the group, and vice versa. 


(a) The components of a phylogenetic tree 



(b) Properties of phylogenetic trees 




Node-based tree Additive, 

(branch lengths all equal) non-ultrametric tree 



ultrametric tree 




Imbalanced tree 
accelerating 
diversification 



Imbalanced tree Balanced tree 
decelerating accelerating 
diversification diversification 



Balanced tree 
decelerating 
diversification 


Figure Box 14.1 The components of a phylogenetic tree (a), and different representations and shapes of trees illustrating their properties (b). 


possible values different metrics can take. The same 
factors may affect the degree to which particular 
metrics are distinct from others (e.g. Redding et al. 
2008; Schweiger et al. 2008). These 'other' patterns 
may be influenced by some of the same processes 
as phylogenetic diversity, but they can be quanti¬ 
fied independently and may on their own influence 


phylogenetic diversity metrics regardless of what 
processes created them. 

Finally, there may be data constraints. Specifi¬ 
cally, the nature of the phylogenetic information 
may only allow representation of the topology of 
a phylogenetic tree, or it may allow estimation of 
branch lengths connecting nodes in the tree (see 
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Box 14.1). In addition, data might be available only 
on the presence or absence of species in particu¬ 
lar places without relative abundances. Unlike the 
processes and patterns mentioned under the first 
two considerations, these constraints are, in theory, 
under the control of the researcher, but, in practice, 
many studies are based on existing data, which may 
impose such constraints. 

In this chapter we do not discuss the first con¬ 
sideration of the link between process and pattern 
(e.g. why do locally co-occurring species represent 
a non-random subset of a regional phylogeny?), 
which has been thoroughly reviewed elsewhere 
(Webb et al. 2002; McPeek 2008; Vamosi et al. 2009; 
Cavender-Bares et al. 2009). Instead we focus on the 
practical issue of what quantitative information is 
reflected in different metrics and on the latter con¬ 
siderations of how properties of the existing data 
may influence the choice or interpretation of dif¬ 
ferent metrics. We restrict our attention to calcula¬ 
tions of phylogenetic diversity within focal sets of 
species or local communities, rather than partition¬ 
ing diversity among hierarchical levels (e.g. a, (3, y; 
Graham & Fine 2008). 


14.2 State of the field 

All empirical studies of phylogenetic diversity 
begin with an estimated phylogeny for the group 
of species of interest. The scope of this phylogeny 
varies—it may include all known species across 
some broader region (e.g. all birds of South America 
or of the world) or only those species present in 
particular surveyed areas (e.g. the birds found in 
a survey of five tropical forest plots). For conve¬ 
nience, we refer to these two options as a 'regional' 
phylogeny and a 'local' phylogeny, respectively. It 
is then typically of interest to quantify the relative 
magnitude of phylogenetic diversity among focal 
subsets of species, which may be defined as those 
co-occurring in a local area (e.g. the birds in one 
tropical forest plot) or a candidate set of species 
proposed for special conservation attention (e.g. the 
bird species listed as endangered in Brazil). We refer 
to the portion of the regional or local phylogeny 
that includes only the focal subset of species as the 
'subset' phylogeny. 


Two qualitatively different types of metrics of 
phylogenetic diversity have been developed. We 
refer to type I metrics as those that begin by cal¬ 
culating a distinctness score for all species in a 
regional phylogeny and then calculating some func¬ 
tion of these scores (typically the sum) for partic¬ 
ular focal subsets of species to yield a metric of 
phylogenetic diversity. Type II metrics start with a 
local phylogeny (or possibly a regional phylogeny), 
and for a focal subset of species they depend only 
on properties of the subset phylogeny. Type I met¬ 
rics have been used largely in conservation biology, 
while community ecologists have mostly employed 
type II metrics, but some have been used in both 
fields. For type I metrics, the motivation behind first 
calculating fixed individual species scores, rather 
than effectively allowing these to depend on the 
focal species set (as in type II metrics), is to per¬ 
mit individual species to be ranked in a way that 
does not depend on the status of other species (e.g. 
whether or not they are already protected). 

Type I metrics of phylogenetic diversity are calcu¬ 
lated in two stages. First, an index of distinctness is 
calculated for each species and second, these values 
are entered into a separate function to summarize 
the scores for a focal subset of species. In stage 
one, five different indices of species' distinctness 
have been used in the literature: taxonomic distinct¬ 
ness (TD), species originality (SO), pendant edge 
(PE), species evolutionary history (SEH), and origi¬ 
nality of species within a set (OSS; see Table 14.1). 
The first two (TD, SO) are based only on node- 
based phylogenetic trees, and the other three (PE, 
SEH, OSS) are based on distance-based trees. In 
stage two, the most common function is simply 
the sum, which is obviously intended to incorpo¬ 
rate species richness into the metric (all else being 
equal, more species represent more phylogenetic 
diversity). With species' abundance data it is also 
possible to apply a procedure similar to rarefaction 
(see Chapter 4) to yield an index that reflects the 
expected sum in a sample of x individuals chosen 
randomly from the community (Ricotta 2004), but 
this is seldom used in the literature. In theory it is 
also possible to calculate the mean of distinctness 
values, although this is also seldom done. 

At first glance, the number of different type II 
metrics in the literature appears rather large, but the 
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Table 14.1 Indices of species distinctness for use in type I metrics of phylogenetic diversity. 


Index 

Description 

Reference 

Taxonomic distinctness (TD) 

Reciprocal of number of nodes between species and root of tree 
(standardized by dividing by the sum of these scores across 
species and multiplying by 100) 

Vane-Wright etal. (1991) 


Modification: To account for polytomies, count number of 
descendants at each node rather than number of nodes 

May (1990) 

Species originality (SO) 

Assign each node in a tree a value of 1 if more species descend 
from that node than its sister node, and 0 otherwise; sum the 
values at the nodes between a species and the root; smaller 
values indicate greater distinctness 

Nixon & Wheeler (1992) 


Modification: As above, but assign each node a value equal to the 
number of species that descend from that node; referred to as 
weighted species originality (WSO) 

Nixon & Wheeler (1992) 

Pendant edge (PE) 

The length of the branch connecting a species to the rest of the 
regional tree 

Altschul & Lipman (1990) 

Species evolutionary history (SEH) 

The portion of a phylogenetic tree attributable to a species; shared 
branches are apportioned equally among descendant lineages 
('equal splits'), for example in a tree with no polytomies, the 
portion of a branch that is assigned to a species that is n nodes 
away from that branch is equal to 1 /2 n . 

Redding & Mooers (2006) 


Modification: As above, but shared branches are apportioned 
equally among descendant species ('fair proportions', 

SEHJait): referred to as species evolutionary distinctiveness 

Redding et al. (2008) 

Originality of species within a set (OSS) 

Values for each species that 'maximize the expected dissimilarity 
between two species randomly drawn from the set' 

There is no simpler way of describing this metric 

Pavoine et al. (2005b) 


Capitalized short forms are used in the text. Equations are not shown because in most cases either verbal descriptions are very simple (TD, PE) or it is not possible to 
write an equation that clarifies any further the meaning of the metric (SO, OSS). Original publications can be consulted for details. 


distinction between many of these is based only on 
the nature of the input data rather than the equa¬ 
tion into which the data enter. Two data charac¬ 
teristics in particular allow different sets of metrics 
to be aggregated. First is the nature of the phylo¬ 
genetic tree (see Box 14.1). If only the tree topol¬ 
ogy has been estimated, 'distances' within the tree 
are quantified simply by counting the nodes along 
the path of interest (e.g. between two tips). Quan¬ 
titative branch lengths allow distances to take a 
continuous range of values. Metrics based on count¬ 
ing nodes are effectively special cases of distance- 
based metrics in which all branch lengths are set 
equal to one. Second is the nature of species abun¬ 
dance data. There may be quantitative estimates of 
species' abundances, or only of species' presence 
or absence; metrics based on the latter are special 


cases of the former in which each species has the 
same abundance. Four distinct kinds of type II met¬ 
ric have been proposed (described in Table 14.2). 
Each of these can incorporate abundance data if 
available, and may also be expressed as deviations 
from expected values based on null models. 

14.2.1 Null models 

Any of the metrics we have described thus far can 
be expressed in their raw form or as deviations 
from an expectation derived from a null model. 
In practice, null models have not been employed 
in the conservation literature, but are commonplace 
in the community ecology literature. In addition, 
null models are almost always used only with 
presence-absence data at the smallest scale. The 
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Table 14.2 Type II metrics of phylogenetic diversity. 


Metric 

Presence-absence (PA) 

version 

Abundance-weighted 
(AW) version 

Equation 

References 

Phylogenetic 

Sum of all branch lengths in the 

For the subset tree, the number of 

£ LjAj 

PA: Faith (1992) 

diversity (PD) 

portion of a phylogenetic tree 
connecting the focal set of 
species (PD, PDn) 

branches multiplied by the 
weighted mean branch length, 
with weights equal to the average 
abundance of species sharing 
that branch* ( PDaw, PDnaw) 

B x -L — 

0 

AW: Barker (2002) 

Mean phylogenetic 

Mean phylogenetic distance 

Mean phylogenetic distance 

1111m<n dmn a m a n 

PA: Webb (2000) 

distance (MPD)' 1 

between each pair of species 
in the focal set (MPD, MPDn) 

between pairs of individuals (or 
other units of abundance), 
excluding same-species pairs 
(, MPDaw, MPDnaw) 

Modification: Mean phylogenetic 
distance between pairs of 
individuals (or other units of 
abundance), including 
same-species pairs 

H Hm<n a m a n 

HT.dmn a m a n 
m n 

T.T. a m a n 
m n 

AW: Warwick & Clarke 

(1995) 

Rao (1982); Warwick & 
Clarke (1995) 

Sum of phylogenetic 

Sum of phylogenetic distances 

Abundance-weighted MPD 

(a^i)x 

PA: Crazier (1997); 

distances (SPD )* 

between each pair of species 

multiplied by the number of 

Y.Y.m<ndmn a m a n 

Helmus et al. (2007) 


Equivalent to MPD multiplied 
by the number of species pairs 

species pairs 

E Hm-.n a m a n 

AW: none 

Mean nearest 

Mean phylogenetic distance 

Weighted mean phylogenetic 

S 

Y'. min (dmn) a m 
m 

PA: Webb (2000) 

neighbour 
distance (MNND) 

from each species to its 
closest relative in the focal 
species set (MNND, MNNDn) 

distance from each species to its 
closest relative, with weights 
equal to species' abundance 
(, MNNDaw, MNNDnaw) 

AW: none 


Short forms correspond to labels in PCA plots (see Fig. 14.1); n = node-based metric. B, number of branches in tree; length of branch <; A-„ average abundance 
(measured in any units) of species that share branch /; d mn , phylogenetic distance between species m and n; a m , abundance of species m (with presence-absence data 
all species have the same abundance); 5, number of species in the focal set; aw, abundance-weighted metric. 

•This is our interpretation of how Barker's iterative method (for unrooted trees) would be applied most simply to a rooted tree. 

t For ultrametric trees, the presence-absence version of MPD is equivalent to twice the phylogenetic species variability (PSV) metric of Helmus et al. (2007), which was 
derived as the expected variance among species in a neutrally evolved trait. The phylogenetic species evenness (PSE) metric of Helmus et al. is a rescaled version of 
abundance-weighted MPD (both PSV and PSE are calculated using a phylogenetic tree with branch lengths scaled so that all root-to-tip distances are 1). 

* Helmus et al. (2007) calculate phylogenetic species richness (PSP) by multiplying MPD by the number of species, rather than the number of species pairs, although 
the two options scale monotonically with one another. 


principle goal in constructing a null model is to ask 
what distribution of values is expected for focal sets 
of species in which there is no phylogenetic struc¬ 
ture, but in which all other properties (e.g. species 
richness) are the same. 

Two main classes of null models have been 
used. First, for a focal set of S species, one can 
take repeated draws of S species chosen randomly 
from the regional phylogenetic tree. In essence, this 
is like shuffling the species identities randomly 
among tips in the phylogenetic tree. Alternatively, 


with a species-by-site data matrix, one can repeat¬ 
edly shuffle the species' presences among sites to 
essentially randomize which species co-occur while 
retaining each species' frequency across sites. It is 
also possible to place a variety of additional con¬ 
straints on the shuffling procedure, such as retain¬ 
ing both the species richness of each site and 
the frequency of each species (Gotelli & Graves 
1996). 

Once a null model has been constructed, the met¬ 
rics of interest are calculated for each simulated set 
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of species, and observed values can be expressed 
either as the number of standard deviations away 
from the expected mean (Webb et al. 2008) or the 
probability of obtaining a lower (or higher) value 
than that observed solely by chance. Since in many 
cases the distribution of values will depend on 
species richness, metrics expressed in this way are 
usually not monotonic transformations of the initial 
values. For example, phylogenetic diversity (PD) 
might be equivalent in two different communities 
with different numbers of species, but the one with 
fewer species will have a lower null expectation, 
and therefore the re-expressed value will be higher 
in the community with fewer species. 

14.2.2 Simulation analyses 

Different metrics can be compared and evaluated 
both qualitatively and quantitatively. In terms of 
qualitative comparisons, a researcher's choice of 
metric may depend on what information they want 
reflected in the metric (e.g. species' closest rela¬ 
tives or their full set of relationships), rather than 
any predetermined quantitative criteria (e.g. sen¬ 
sitivity to a non-random community assembly). 
Both qualitative and quantitative considerations are 
addressed in the Prospectus section, and to make 
quantitative comparisons we conducted a set of 
simulation analyses. The simulations were aimed at 
addressing the following three questions: (1) which 
metrics are redundant with one another, (2) how 
do metrics compare with respect to their sensitiv¬ 
ity to different non-random community assembly 
processes, and (3) how does tree shape influence the 
answers to questions 1 and 2? 

We simulated sets of species ('communities') 
under a number of scenarios that vary in the follow¬ 
ing respects: the degree of balance in the regional 
phylogenetic tree, the change or lack thereof in 
diversification rate through time in the regional 
phylogeny (decelerating or accelerating, according 
to the 'y' parameter, sensu Pybus & Harvey 2000), 
the nature of (non)randomness in the commu¬ 
nity assembly process, and the number of species 
in the community. All regional phylogenetic trees 
were rooted, ultrametric, contained 100 species, and 
were created in five ways: (1) by a pure birth 
('Yule') process (constant rate of diversification 


through time and across all lineages), (2) entirely 
imbalanced with decelerating diversification, (3) 
entirely imbalanced with accelerating diversifica¬ 
tion, (4) entirely balanced, decelerating diversifica¬ 
tion, and (5) entirely balanced, accelerating diversi¬ 
fication. In order to span the range of empirical val¬ 
ues of change in diversification rate, we rescaled the 
branch length of both the balanced and imbalanced 
trees to match the most extreme observations (mea¬ 
sured via the y statistic; see McPeek 2008). For each 
tree, we simulated local communities either by ran¬ 
domly selecting species or via algorithms that cre¬ 
ated phylogenetically clustered or over-dispersed 
sets of species, with each of the latter done in two 
ways. For phylogenetic over-dispersion, the first 
species was chosen randomly and each subsequent 
species was chosen with a probability proportional 
to the square root of its average phylogenetic dis¬ 
tance to either species (method 1) or individuals 
(method 2) already in the community. Using the 
square root of phylogenetic distances approximates 
the expected trait difference based on a Brown¬ 
ian motion model of trait evolution (Felsenstein 
1985). For method 2, the mean phylogenetic dis¬ 
tances were calculated by weighting the distance 
to each already-chosen species by the abundance 
of that species. Phylogenetically clustered sets of 
species were created in an identical way, except that 
the mean phylogenetic distance to already-chosen 
species or individuals determined the probability 
of a species not being chosen. Abundances were 
either considered to be equivalent across species 
(presence-absence based metrics) or drawn from a 
log-normal distribution and assigned from highest 
to lowest in the sequence that species were chosen. 
Sets of species were selected with species richness 
of 10, 20, 30, or 40. For each of the five tree types, 
we simulated 500 sets of species for each of the 
four levels of species richness in each of the five 
community assembly processes, for a total of 20 000 
sets of species. 

We focused our analyses on two groups of 
metrics, which correspond to those of interest to 
conservation biologists and those of interest to 
community ecologists. The 'conservation' group 
consisted of all type I metrics (sums of species 
distinctness values), plus Faith's (1992) phyloge¬ 
netic diversity (PD) because type I metrics are often 
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Table 14.3 Correlations among sums of type I metrics, as well as species richness (SR) and phylogenetic 
diversity (PD), for 10000 simulated species sets selected from a pure birth phylogenetic tree. Short forms 
correspond to those in Table 14.1. 


TD 

0.92 


SO 

0.95 

0.91 

mo 

0.94 

0.92 

PE 

0.93 

0.83 

SEH 

0.95 

0.96 

SEHJair 

0.97 

0.96 

OSS 

0.73 

0.91 

PD 

0.87 

0.90 


SR 

TD 


0.98 



0.91 

0.87 


0.94 

0.95 

0.86 

0.95 

0.95 

0.87 

0.76 

0.79 

0.62 

0.85 

0.86 

0.78 

SO 

WSO 

PE 


1.00 



0.88 

0.87 


0.94 

0.94 

0.87 

SEH 

SEHJair 

OSS 


evaluated with respect to their ability to capture PD 
(e.g. Redding et al. 2008). Because community ecol¬ 
ogists are typically interested in assessing non- 
random phylogenetic structure in sets of species, 
rather than phylogenetic diversity per se, the 'com¬ 
munity ecology' group of metrics included only the 
null-model corrected versions of each type II met¬ 
ric. These were calculated as the number of stan¬ 
dard deviations from the mean across randomly 
assembled communities (described above). In addi¬ 
tion, since the value of the summed phylogenetic 
distance (SPD) and the modified mean phyloge¬ 
netic distance (MPD; see Table 14.2) differ from 
unmodified MPD only due to species richness or 
the species abundance distribution, both of which 
are accounted for in the null model, the null-model 
corrected versions of these are redundant. Thus, 
we focused on the four versions of PD, MPD, and 
MNND, depending on the combination of node- 
vs distance-based trees and presence-absence vs 
abundance data, for a total of 12 metrics in the com¬ 
munity ecology group. To calculate these metrics, 
we drew on publicly available functions in the ape 
(v2.3), picante (v0.7), and ade4 (vl.4) libraries for R. 
New phylogenetic functions coded specifically for 
this analysis are available within the Picante library 
for R (http://picante.r-forge.r-project.org/). 

14.2.3 Simulation results 

Correlations among type I metrics, and between 
these metrics and phylogenetic diversity (PD), were 
typically very high. Principal component analy¬ 
ses for each tree type revealed that the variation 


among these metrics is essentially one dimen¬ 
sional, with >90% of the variance explained by the 
first principal component in all cases (figures not 
shown). Table 14.3 shows the correlation structure 
among these metrics (as well as species richness) 
for the pure birth tree, where r > 0.7 for all pair¬ 
wise comparisons except one (for which r > 0.6). 
Correlations among metrics were even higher for 
all other tree types, with the one exception aris¬ 
ing for balanced trees with accelerating diversifica¬ 
tion. In this case, all species receive highly similar 
distinctness scores (see example of this tree type 
in Box 14.1), such that type I sums give virtually 
identical results to each other and to species rich¬ 
ness (r > 0.99). However, PD varies considerably 
depending on how many basal clades are included 
in a given sample of species, such that correla¬ 
tions between PD and the type I sums were rel¬ 
atively low (r ~ 0.45). For pure birth or balanced 
trees, correlations of these metrics with species 
richness were high (mean r > 0.9), whereas for 
imbalanced trees correlations with species richness 
were considerably lower (mean r = 0.62 for deceler¬ 
ating diversification, mean r = 0.45 for accelerating 
diversification). 

Type II metrics also gave broadly similar results 
to one another for pure birth or balanced trees, but 
not for imbalanced trees (Table 14.4). For pure birth 
and imbalanced trees, correlations tended to be 
higher within distance-based or node-based groups 
of metrics than across these groups (Fig. 14.1). 
For imbalanced trees the correlations across the 
two groups were actually negative (Fig. 14.1), a 
result that can be explained as follows. Commu- 
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Table 14.4 Correlations of distance-based (0) vs. node-based ( N ), and presence-absence (PA) vs. abundance-weighted (AW) 
versions of each of the three main type II metrics, in all cases expressed as deviations from a null model. 



Presence-absence 

version 

Abundance- 
weighted version 

Distances 

Nodes 


D vs. N 

D vs. N 

PA vs. AW 

PA vs. AW 

Metric 






PD 

0.79 

0.68 

0.89 

0.85 

Pure birth tree 

MPD 

0.69 

0.80 

0.91 

0.79 


MNND 

0.84 

0.83 

0.88 

0.89 


PD 

-0.75 

-0.55 

0.83 

0.35 

Imbalanced tree 

MPD 

-0.25 

-0.15 

0.95 

0.84 

Decelerating diversification 

MNND 

-0.10 

0.02 

0.96 

0.53 


PD 

-0.68 

-0.45 

0.86 

0.40 

Imbalanced tree 

MPD 

-0.71 

-0.69 

0.91 

0.88 

Accelerating diversification 

MNND 

-0.26 

0.05 

0,93 

0.56 


PD 

0.99 

0.96 

0.92 

0.96 

Balanced tree 

MPD 

0.98 

0.98 

0.95 

0.97 

Decelerating diversification 

MNND 

0.99 

0.98 

0.95 

0.95 


PD 

0.96 

0.93 

0.97 

0.95 

Balanced tree 

MPD 

0.96 

0.97 

0.91 

0.95 

Accelerating diversification 

MNND 

0.98 

0.96 

0.92 

0.93 



nity assembly was based on a tree with quantitative 
distances along branches, such that over-dispersed 
sets of species contain many of the species that 
connect to the rest of the tree via long branches 
(early-divergent or 'basal' species). Although these 
species are relatively distantly related to most oth¬ 
ers, they are separated from one another by rela¬ 
tively few nodes, such that over-dispersed sets of 
species actually appear clustered on a node-based 
tree. For clustered community assembly, if the first 
species chosen (randomly) is relatively basal, then 
most other species will be equidistant from the first 
and therefore have similar probabilities of joining 
the community next. For the resulting set of species, 
the value of node-based metrics will actually be 
higher than when over-dispersed species were cho¬ 
sen during community assembly. This creates the 
observed negative correlations. Within the distance- 
based type II metrics (null-model corrected), varia¬ 
tion among sets of species was largely one dimen¬ 
sional, with >85% of the variation in each PC A 
explained by the first axis (Fig. 14.1). 

Sensitivity to non-random community assembly 
processes varied considerably among metrics and 


among types of phylogenetic trees (Fig. 14.2), with 
several notable patterns. First, for imbalanced trees, 
node-based metrics are either insensitive to non- 
random community assembly or they actually devi¬ 
ate from the null model in the opposite direction 
than expected (Fig. 14.2b and 14.2c). In particu¬ 
lar, over-dispersed communities had lower node- 
based metric values than expected, for the reasons 
explained above with respect to negative correla¬ 
tions between distance-based and node-based met¬ 
rics on imbalanced trees. 

The amount of variance among species in their 
phylogenetic distinctness mediated the effect of tree 
shape on the sensitivity of different metrics. We 
focus here only on distance-based metrics (the first 
and third panels from the left in Fig. 14.2). For bal¬ 
anced trees (Fig. 14.2d and 14.2e) there are rela¬ 
tively few basal clades, each with many species, 
such that overall, variance among species in their 
phylogenetic distinctness is small. In other words, 
on balanced trees most species in the phylogeny 
have very similar distribution of relatedness to 
other species on the tree and also very similar 
type I scores. In this evolutionary context, clustered 
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assembly leads to sets of species that may be con¬ 
centrated in only one of a few clades. In con¬ 
trast, randomly assembled communities (which 
contained at least 10 species) will probably have 
members from across many or all deep clades, lead¬ 
ing to large deviations from random expectation 
in metrics for clustered communities. In contrast, 
over-dispersed communities have representatives 
from many clades just as randomly assembled com¬ 
munities often do, with the non-random selection 
of species within clades only capable of creating 
relatively small deviations from random expecta¬ 
tion. In imbalanced trees with accelerating diver¬ 
sification (Fig. 14.2c), a relatively small proportion 
of species are far more phylogenetically distinct 
than most others, leading to large deviations of 
over-dispersed communities (which include these 
species) from random sets of species. With the 
first species in each simulation chosen randomly, 
even clustered communities will sometimes include 
basal species, and after the first species is cho¬ 
sen all more derived species are equally related 
and have an equal chance of being selected, such 
that clustered communities deviate from random 
to a lesser extent than over-dispersed communi¬ 
ties despite symmetry in the ecological community 
assembly processes. In pure birth trees there is suffi¬ 
ciently little variation among species in distinctness 
that over-dispersion is easier to detect than cluster¬ 
ing (Fig. 14.2a), and in imbalanced trees with decel¬ 
erating diversification over-dispersed and clustered 
communities show approximately equal deviations 
from the null expectation. 

The final point to make concerning Fig. 14.2 is 
that in some cases the sensitivity to non-random 
community assembly varies among metrics. Again 
focusing only on distance-based metrics, for pure 
birth or balanced trees, MPD was more sensitive 
than PD or MNND, while for imbalanced trees the 
differences among metrics were relatively small. 


Figure 14.1 Principal component analyses for 12 type II null-model 
corrected metrics (top row) and for the six of these based on quantitative 
branch length information (distance-based metrics, bottom row). Labels 
correspond to those in Table 14.2 and the ordering in lists corresponds to 
the factor score for the relevant metric on PC2. The percentage variance 
accounted for by each axis is shown in the axis label. 


143 Prospectus 

14.3.1 Phylogenetic diversity in conservation 

To the extent that assigning individual species dis¬ 
tinctness scores is itself of value in conservation 
biology, researchers and practitioners should con¬ 
tinue to find type I metrics useful. The sums across 















PA, dist PA, nodes AW, dist AW, nodes 


Figure 14.2 Sensitivity of each type II metric to over-dispersed or clustered phylogenetic community assembly processes. The y-axis shows the mean ±1 
standard deviation for each type II metric, expressed as the number of standard deviations from a null-model mean, in 2000 focal species sets. Results are 
shown for community assembly processes in which species abundance was not part of the process (results were very similar for both versions of the 
community assembly process). 
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species for any of the type I metrics correlate fairly 
strongly with each other and with PD. Distance- 
based trees more completely represent evolutionary 
relationships than node-based trees, and since the 
use of node-based metrics appears to be motivated 
largely by logistical considerations (i.e. the lack of 
a distance-based tree), distance-based metrics seem 
clearly preferable. Within node-based and distance- 
based metrics, redundancy is quite high (r > 0.9 
for most pairs of metrics across all tree types). As 
such, an appropriate criterion for choosing among 
metrics is their conceptual and mathematical sim¬ 
plicity. For node-based metrics, we recommend the 
taxonomic distinctness ( TD ) metric as it is more 
straightforward than either version of the species 
originality metrics (OS, WOS; see Table 14.1). For 
distance-based metrics, the originality-of-species- 
in-a-set (OSS) metric is conceptually less straight¬ 
forward, requires an ultrametric tree, is mathemat¬ 
ically much more complicated, and captures less 
PD than do the others (Redding et al. 2008), so 
we do not yet recommend its use, despite the fact 
that it shows relatively low correlations with some 
other metrics. The pendant-edge (PE) and species 
evolutionary history ( SEH ) metrics are both sim¬ 
ple conceptually and mathematically (PE more so). 
Although PE and SEH are largely redundant on 
most tree shapes, SEH contains more information 
and will, for example, identify as evolutionarily 
distinct each species in a pair of close relatives 
when the species pair itself is evolutionarily dis¬ 
tinct, whereas PE would be quite small in this case. 
It does not matter which method of apportioning 
shared branches is used in the calculation of SEH 
(r > 0.98 for all tree types). 

In some cases a conservation biologist may not be 
interested in prioritizing species, but in prioritizing 
sites based on the species they contain (e.g. For¬ 
est et al. 2007) or in understanding how ecosys¬ 
tem processes depend on the phylogenetic diver¬ 
sity, rather than only the richness, of the species 
in a community (e.g. Cadotte et al. 2008). In 
such cases, there is no need to assign individual 
scores and it is more appropriate to use PD as a 
straightforward proxy for the quantity of evolu¬ 
tionary history and therefore trait variation in a 
community. 


14.3.2 Phylogenetic diversity in 
community ecology 

When community ecologists are interested in using 
phylogenetic information to assess the degree to 
which community assembly has been non-random 
with respect to species traits or relatedness, it 
is most appropriate to use null-model corrected 
versions of type II metrics. The mean value of 
some metrics (in their raw form) is correlated with 
species richness (e.g. MNND decreases as species 
are added to a community), and even for those 
that are not (e.g. MPD) the variance among ran¬ 
domly assembled communities may well be corre¬ 
lated with species richness. The range of possible 
MPD values, for example, declines as the number of 
species increases. As such, the extent to which the 
members of a community represent non-random 
selections from a phylogeny cannot generally be 
assessed using the raw values for type II metrics. 
In terms of null models, shuffling species among 
phylogeny tips seems most straightforward, as this 
most directly randomizes the key data attribute of 
interest—the phylogenetic positions of species in 
the community. 

Perhaps the most striking result from our simu¬ 
lations was the faulty performance of node-based 
metrics on imbalanced phylogenetic trees. At best 
they are insensitive to non-random community 
assembly, and at worst they could lead to seriously 
flawed conclusions. For example, if community 
membership depends on a species being phyloge- 
netically distinct from others, a node-based metric 
may actually suggest lower rather than higher phy¬ 
logenetic diversity than expected based on a ran¬ 
dom selection of species (Fig. 14.2b, and 14.2c). Even 
for pure birth or balanced trees there is little to no 
power to detect over-dispersion using a node-based 
representation of a phylogenetic tree. As such, we 
recommend against the use of node-based trees 
in studies of phylogenetic community assembly 
and advocate great caution for researchers who 
nonetheless decide to proceed with studies of this 
kind. 

For distance-based metrics, MPD showed greater 
power than PD or NMMD to detect non-random 
community assembly. However, the probability of 
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species being added to communities in our simu¬ 
lations was a function of their mean phylogenetic 
distance to already-chosen species, so this result is 
unlikely to generalize across different kinds of non- 
random community assembly processes. In theory, 
the relative magnitudes of different metrics' devia¬ 
tion from a null model might itself reveal something 
about underlying processes (e.g. does community 
membership depend on having few very close rel¬ 
atives or on average relatedness?), but more work 
is needed to determine whether such differences 
might also arise as artefacts of the nature of under¬ 
lying data. 

A more pronounced concern than which 
distance-based metric to select is how the shape of 
the phylogenetic tree itself influences the likelihood 
of detecting non-random community assembly. 
Balanced trees make it easy to detect phylogenetic 
clustering but quite difficult to detect phylogenetic 
over-dispersion, whereas the opposite is true for 
imbalanced trees with accelerating diversification 
(Fig. 14.2). Fortunately, many phylogenetic trees 
are more imbalanced than a pure birth tree with 
decelerating diversification (Mooers and Heard 
1997; McPeek 2008), in which case clustering 
and over-dispersion have similar chances of 
being detected (Fig. 14.2b). Statistics exist for 
quantifying both balance (Heard 1992) and trends 
in diversification (Pybus & Harvey 2000); these 
statistics should be calculated by researchers 
interested in phylogenetic community structure, 
used to interpret the statistical power of the 
analysis, and reported with empirical results. Our 
most general recommendation is for researchers 
to think carefully about this issue when drawing 
conclusions, especially when making the explicit 
or implicit assumption that phylogenetic similarity 
is a proxy for trait similarity. For example, even a 
small degree of convergent evolution in a balanced 
tree (e.g. one or two species in one clade evolve 
similar phenotypes as a different clade) could 
lead to phylogenetic over-dispersion when really 
the species in a set are highly clustered in trait 
space (see also Kraft et al. (2007)). Exploring some 
simulation results using the empirical phylogeny 
of interest seems warranted in this case, despite the 
non-trivial burden this places on the researcher. 


14.3.3 Abundance vs presence-absence data 

The vast majority of empirical studies on phylo¬ 
genetic diversity have not incorporated data on 
species' abundances. In conservation, some effort 
has been made to combine phylogenetic distinct¬ 
ness and extinction risk in prioritizing species (Red¬ 
ding & Mooers, 2006; Isaac et al. 2007), which 
effectively assigns greater weight to species with 
lower abundance. Given the many criteria, in addi¬ 
tion to phylogenetic distinctness, that might enter 
into the equation for conservation prioritization we 
did not explore the explicit incorporation of abun¬ 
dance data into type I measures of phylogenetic 
diversity. 

One of the underlying premises of many stud¬ 
ies in phylogenetic community ecology is that the 
fitness of individual organisms depends on their 
similarity to other organisms in the community, 
in which case species' abundances should be an 
important consideration. To an individual organ¬ 
ism, if several species in the community are dose 
relatives, it should matter far more if those species 
are abundant than if they are rare. In our simula¬ 
tions, non-randomly assembled communities were 
created either with abundance taken into consider¬ 
ation or not, but this had virtually no influence on 
the values of different null-corrected type II met¬ 
rics or their dependence on tree shape (compar¬ 
ison not shown; Fig. 14.2 shows only the case in 
which abundances were not incorporated into com¬ 
munity assembly). This was counter to our expec¬ 
tation that abundance-weighted metrics would be 
more sensitive to abundance-weighted community 
assembly. 

The reason appears to be that the particular 
nature of the phylogeny obscures the abundance- 
weighted metrics' ability to recover the pattern. 
Consider the clustering assembly process: in our 
assembly algorithm, the first and most abundant 
species was chosen randomly from the phylogeny 
of potential species, and that species' lineage may 
or may not have close relatives. If that most abun¬ 
dant species does have close relatives, those closely 
related species are very likely to be chosen and the 
abundance-weighted metrics will detect a closely 
related community. However, in many trials the 
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most abundant species will not have close rela¬ 
tives and so weighting the selection process by 
this species abundance will have little effect. The 
second most abundant species may be the most 
abundant species' closest relative, but in the con¬ 
text of the broader phylogeny those species could 
be relatively distantly related. In this case, despite 
the abundance-weighted assembly process, the 
abundance-weighted metrics perform poorly. Thus, 
the particular location on the phylogeny of the most 
abundant species adds considerable random varia¬ 
tion to the performance of the abundance-weighted 
metrics, making them generally less sensitive 
to non-random assembly (Fig. 14.2). There was 
one exception to the general pattern: abundance- 
weighted MNND was more sensitive to abundance- 
weighted over-dispersion compared to the non¬ 
abundance-weighted measure. 

For different kinds of non-random community 
assembly, for example if abundance is the outcome 
of competitive interactions rather than determined 
based on assembly order, we might expect abun¬ 
dance data to reveal more than presence-absence 
data. Another case in which abundance seems 
likely to be important is when ecosystem func¬ 
tion depends on phylogenetic diversity (as a proxy 
for trait variation; see, for example, Cadotte et al. 
(2008)), as these ecosystem functions are performed 
by individual organisms rather than species per se. 
Comparison of abundance-weighted metrics with 
their presence-absence counterparts appears to be 
a potentially fruitful avenue of future research. 

14.4 Key points 

1. Metrics of phylogenetic diversity are used in 
conservation biology, where it is desirable to first 


assign individual species distinctness scores, and 
in community ecology, where it is of inter¬ 
est to assess the degree to which the species 
in a community represent a non-random sub¬ 
set of the species in a reference phylogenetic 
tree. 

2. The metrics used in conservation biology show 
a high degree of redundancy (i.e. strong corre¬ 
lations) with one another and with total phy¬ 
logenetic diversity (the sum of branch lengths 
connecting the species in a focal set). For a 
node-based phylogenetic tree (i.e. no informa¬ 
tion on branch lengths), the taxonomic diver¬ 
sity (TD) measure is conceptually the most 
straightforward, and for a distance-based tree, 
the pendant-edge (PE) and species evolu¬ 
tionary history ( SEH ) metrics are both con¬ 
ceptually straightforward and mathematically 
simple. 

3. In tests of non-random community assembly, 
metrics need to be standardized based on null 
models to remove inherent dependence on 
species richness. For such null-corrected metrics, 
those calculated from node-based trees can be 
seriously misleading. Redundancy is fairly high 
among those calculated from distance-based 
trees. We recommend against the use of node- 
based trees in phylogenetic community ecology. 

4. The sensitivity of phylogenetic diversity met¬ 
rics depends strongly on the shape of the 
phylogenetic tree, with phylogenetic clustering 
far more detectable in some cases (balanced 
trees) and phylogenetic over-dispersion far more 
detectable in others (e.g. imbalanced trees with 
accelerating diversification). It is important for 
empirical researchers to take the effect of their 
particular tree shape on statistical power into 
account. 



CHAPTER 15 


Genetic methods for 
biodiversity assessment 

Melanie Culver, Robert Fitak, and Hans-Werner Herrmann 


15.1 Introduction 

Biodiversity encompasses the extant diversity of 
plants, animals, fungi, and microorganisms; includ¬ 
ing the genes they contain and the community in 
which they live. There are three fundamental levels 
of biological diversity to consider: genetic diversity, 
species diversity, and community diversity (Pri- 
mack et al. 1998). Genetic diversity is the sum of the 
genetic variation within a species, species diversity 
refers to the number of species present and their 
relative abundance, and community diversity is the 
collection of organisms co-existing in their physical 
environment. Recently, genetics has become a valu¬ 
able tool for investigating biodiversity. Although all 
three levels of biodiversity can benefit from genetic 
methods, in this chapter our attention is focused 
on using these methods to assess both genetic and 
species diversity. 

Genetic diversity is a critical component of bio¬ 
diversity for two reasons: first, genetic diversity is 
the essence of all biodiversity, and second, genetic 
and biological diversity are mutually dependent 
(Lankau & Strauss 2007). In other words, the 
diversity between species (biological diversity) is 
dependent on the diversity within a species (genetic 
diversity) and vice versa. Therefore, techniques to 
monitor the long-term changes in and distribu¬ 
tion of genetic diversity are essential (Drayton & 
Primack 1996; Wilson et al. 1996b; Manley et al. 
2004). In the past, techniques to assess genetic diver¬ 
sity included restriction fragment length polymor¬ 
phism (RFLP) analyses and the comparison of DNA 
sequences from single or multiple genes. Recently, 
techniques to analyse thousands of locations across 
whole genomes have become more commonly 


used. These whole genomes and genome-wide 
scans focus on a variety of unlinked co-dominant 
genetic markers such as single nucleotide poly¬ 
morphisms (SNPs) and microsatellite DNA loci 
(STR). At the population level, genetic diversity is 
more difficult to assess. Gene flow occurs at vary¬ 
ing levels and may result in complex genetic pat¬ 
terns that can extend from completely panmictic to 
mostly isolated populations. Fairly isolated popu¬ 
lations with a unique genetic/phenotypic signature 
and evolutionary trajectory are generally referred 
to as subspecies or evolutionarily significant units 
(ESUs). 

Species diversity is a measure commonly used 
to represent biological diversity. This is because 
species are a comparatively well-defined biolog¬ 
ical category; a trait that cannot be claimed for 
above-species systematic categories (genera, fami¬ 
lies, orders, etc.) or the lower subspecies category. 
However, as biological species are, by definition, 
isolated gene pools (no exchange of genetic mate¬ 
rial), the actual 'units of evolution', the precursors 
of species, are populations (see DeQuiroz 2007 for 
a detailed discussion). These more-or-less isolated 
gene pools (populations) contain unique signatures 
that can at times only be identified using genetic 
methods. Genetic methods are also an advantage 
when taxonomists are unable to classify species 
using morphological characters, such as in species- 
rich groups like Coleoptera, or when species are 
rare, cryptic, or otherwise difficult to identify. 

Why are we interested in measuring and assess¬ 
ing biodiversity? The answer is two-fold: (i) to 
determine how many species are present and (ii) 
to estimate what ('biodiversity') is required to 
maintain the integrity of evolutionary processes. 
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In the following, we discuss the application of 
genetics in examining these key topics in biodi¬ 
versity assessments. After a short description of 
the methods currently used, we review how phy¬ 
togenies, non-invasive monitoring, DNA barcod¬ 
ing, and genomics have been employed to measure, 
maintain, and (potentially) revive biodiversity. 

15.2 Genetic methods in biodiversity 
assessment 

As a tool to investigate biological diversity, mole¬ 
cular genetics has gained high resolving power 
in the last two decades with the rapid innovation 
of biotechnological methods (Palumbi & Cipriano 
1998) and the development of computer-based tools 
to manage large amounts of genetic data. Here 
we only consider the areas of molecular genetics 
that are important for examining biological diver¬ 
sity. These areas are concentrated around methods 
which utilize the two primary sources of DNA: 
organellar DNA and nuclear DNA. 

15.2.1 Mitochondrial, chloroplast, 
and nuclear DNA 

For the purpose of this chapter genetic data consists 
of DNA sequence variation in mitochondrial DNA 
(mtDNA), chloroplast (cpDNA), or nuclear DNA 
(nDNA). This variation can be quantified in differ¬ 
ent ways depending on the question addressed and 
the level of resolution required. Generally, nuclear 
genes, which have the slowest evolutionary rate, 
reveal more ancient evolutionary resolution than 
mitochondrial or chloroplast genes, which have 
more rapid evolutionary rates. Non-coding regions 
of DNA, however, are the most rapidly evolving 
relative to nuclear, chloroplast, and mitochondrial 
genes (Table 15.1). 

DNA sequence variation is characterized as 
allelic variation for co-dominant markers such as 
SNPs and microsatellites in a diploid (or higher) 
system or as haplotypes on a section of hap¬ 
loid DNA such as found in mitochondria, chloro- 
plasts, Y-chromosomes (or other heterogametic sex 
chromosomes), or some organisms without sexual 
reproduction. The observed sequence variation for 
an organism can either be unique or shared with 


other organisms. Depending on the question asked, 
the appropriate molecular (genetic) marker must 
be examined. Usually, sets of genetic markers with 
varying degrees of resolution due to differing evo¬ 
lutionary rates are used. 

15.2.2 Genome technologies 

New high-throughput sequencing technologies 
(e.g. 454 pyrosequencing) have largely reduced 
the cost of genome-scale sequencing (for a review 
of novel sequencing technologies, see Ansorge 
(2009)), therefore analyses at the genomic level are 
now within the financial reach of typical genetic 
projects. Researchers have recently employed high- 
throughput (next- generation) sequencing to extract 
thousands of SNP markers (Vera et al. 2008) for sub¬ 
sequent microarray development, and microsatel¬ 
lite markers (Castoe et al. 2009) for population 
genetic analyses. The search for large numbers of 
genetic markers is enhanced by the potential to 
simultaneously sequence multiple 'tagged' samples 
in the same reaction (Roche Diagnostics 2009). The 
considerable increase in the number of markers will 
improve the resolution and precision of genotyping 
efforts, and thus the ability to examine biodiversity. 
Also, these genomic-level analyses are not limited 
to the taxon sequenced but, depending on phy¬ 
logenetic distance and questions addressed, may 
result in other, 'genome-enabled' taxa (Kohn et al. 
2006). 

15.3 Biodiversity assessments 

15.3.1 Phytogenies for biodiversity 
assessment using mtDNA and nuclear DNA 

One of the greatest contributions of genetics to 
wildlife conservation is through resolving taxo¬ 
nomic uncertainties. Cryptic species are those with 
few morphological differences, but identified as 
separate taxonomic units based on other types of 
data (e.g. genetic, behavioural, vocalizations). With¬ 
out genetic studies cryptic species could remain 
unknown or become extinct without ever being 
recognized. Conversely, genetic data may prompt 
the pooling of phenotypic variants to the same 
species or population, reducing the number of 



Table 15.1 Representative molecular genetic markers used for biodiversity assessments across the animal and plant kingdom. 
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recognized species. Genetic analyses are a powerful 
tool for revealing the evolutionary history and tax¬ 
onomic position of species. 

New species discovery in large mammals using 
genetic techniques is rare and has been exempli¬ 
fied twice in recent years in the Felidae and Pro- 
boscidae. Molecular evidence has shown a distinct 
clouded leopard species occurring on the islands of 
Borneo and Sumatra ( Neofelis diardi). The previously 
described species, N. nebulosa, is now considered 
to be restricted to the Asian mainland. Two stud¬ 
ies using mtDNA, microsatellite markers, nuclear 
DNA sequences, and cytogenetics revealed a deep 
historical partition between these populations, war¬ 
ranting species-level status (Buckley-Beason et al. 
2006; Wilting et al. 2007). This finding was fur¬ 
ther supported by a morphometric investigation of 
pelage features (Kitchener et al. 2006). In this case 
molecular markers led to the description of a new 
species. Another molecular study distinguished for¬ 
est and savannah elephants from each other as dif¬ 
ferent species ( Loxodonta africana, forest; Loxodonta 
cyclotis, savannah) (Roca et al. 2001). This study 
used 1732 nucleotides from four nuclear genes to 
describe these distinct species, which show as much 
divergence as Asian elephants to African elephants, 
and is an example of adaptive evolution to forest 
versus savannah habitat types. 

Genetic analyses may also suggest the opposite, 
that is, that populations or phenotypes previously 
considered to be different species are found to be 
so genetically similar that they are more accurately 
classified as one species. For example, also in mam¬ 
mals, the Felis silvestris complex (including F. sil- 
vestris, F. catus, F. libyca, and F. bieti), has low lev¬ 
els of genetic differentiation, suggesting that they 
are more appropriately classified as a single species 
(Driscoll et al. 2007). 

Because conservation efforts often focus at the 
species or subspecies level (e.g. IUCN Red List, 
US Endangered Species Act) a population's protec¬ 
tion status may be directly related to its taxonomic 
status. As evolution is a slow, gradual process, 
species assignments are not always unambiguous. 
During a speciation event, there will be a period 
where the genetic difference is deemed insufficient 
to warrant subspecies or species status. Such vari¬ 


ation may still comprise local adaptations of sig¬ 
nificant ecological and evolutionary importance, 
or differentiation caused by genetic drift, perhaps 
leading to speciation (the Felis silvestis complex may 
be an example of this). Where to draw the line for 
the level of genetic differentiation worth protecting 
is often debatable. As the resolution and population 
coverage of genetic analyses increase, such dilem¬ 
mas will become increasingly common and conser¬ 
vation decisions will have to be guided by more 
clearly defined conservation goals. 

15.3.2 Non-invasively monitoring 
for biodiversity 

Genetic monitoring programmes are often amen¬ 
able to non-invasively collected samples (e.g. hair, 
feces, urine), which is appealing for investigating 
elusive animals. For example, hairs collected by 
birds to reinforce or insulate their nests can be 
used to efficiently census rare and elusive mam¬ 
mals (Toth 2008). Genetic methods can also be used 
to analyse diets and disease, typically from faecal 
analysis. While morphological analysis of scat con¬ 
tents typically identifies to genus level, molecular 
identification can often identify individual species 
(Dee et al. in review). Animal scat is used to indicate 
the presence of a species, although visual identi¬ 
fication of scat is frequently ambiguous even for 
experts (Halfpenny 1986; Foran et al. 1997). Recent 
advances in molecular genetics, however, allow 
species to be identified reliably from both animal 
scat (Farrell et al. 2000) and bone samples (Taberlet 
& Fumagalli 1996). 

Because carnivore scat contains bone material 
from prey, and both prey and predator species can 
be identified from a single scat sample, this illus¬ 
trates carnivore scat as a source for assessing ver¬ 
tebrate biodiversity. Taberlet and Fumagalli (1996) 
described a similar method for monitoring small- 
mammal communities using DNA extracted from 
bone material removed from owl pellets. In general, 
scat from mammalian predators is more ubiquitous 
in the environment than owl pellets and may con¬ 
tain a higher variety of prey items (Farrel et al. 
2000), therefore analyzing predator scat samples 
and the bone material within them could effectively 



212 BIOLOGICAL DIVERSITY 


maximize the number of species detected while 
minimizing the number of samples. 

Molecular scatology has been applied to the field 
of biodiversity assessment through identification of 
rare species (e.g. Kohn et al. 1995 ; Palomares et al. 
2002) and diet analyses (e.g. Kvitrud et al. 2005; Par¬ 
sons et al. 2005). Although molecular scatology has 
been applied to several sympatric species simul¬ 
taneously and to the diet analysis of predators, it 
has yet to be applied at broader community lev¬ 
els such as the measurement of species richness. 
As molecular techniques become easier to use, less 
expensive, and more available to biologists and 
managers, techniques such as those described here 
should better address community-level questions in 
wildlife biology and in estimating community-level 
parameters such as species richness. 

15.3.3 DNA barcoding for biodiversity 
assessment 

DNA barcoding is a technique using a short 
sequence, usually a haploid gene fragment in mito¬ 
chondria or chloroplasts, to help classify the species 
of the organism from which the DNA originated. 
This technique has been designated as a tool to 
not only identify existing diversity from 'unidenti¬ 
fied' specimens, but also as a tool to discover new 
biodiversity (species) and therefore to gain new 
genetic information. DNA barcoding is intended 
to become a universal biodiversity tool with an 
open- access database that allows for the identifica¬ 
tion of specimens by non-taxonomists and follows 
a standardized, minimalistic procedure. Its benefits 
include protecting endangered species, sustaining 
natural resources, discovering hidden biodiversity, 
discovering biodiversity in difficult to access sce¬ 
narios (i.e, eggs, larvae, etc.), controlling agropests, 
and becoming a powerful environmental moni¬ 
toring tool. Conceptually, DNA-based biodiversity 
assessment tools have merit over morphology as 
phenotypic divergence is driven by selection and 
confounded by homoplasy, whereas genetic diver¬ 
gence is driven by stochastic processes that are 
quasi neutral (Winker 2009). In this context genetic 
data are of tremendous value. That neutral char¬ 
acters are not drivers of speciation is critical. How 


continuous is the nature of these neutral characters 
and are 'bins' appropriate? 

The promise of (single-locus) DNA barcoding 
DNA barcoding originated when Paul Herbert and 
colleagues from the University of Guelph published 
a paper on the technique (Hebert et al. 2003). They 
demonstrated that it was possible to reliably assign 
samples to the correct higher systematic groups of 
hexapods and to identify closely related lepidopter- 
ans down to the species level. The study examined 
DNA sequence differences in an approximately 650 
bp fragment of the mitochondrial cytochrome c oxi¬ 
dase I (COI) gene. Taxonomic expertise is collapsing 
and a standardized procedure is required to eval¬ 
uate the millions of species encompassed within 
biological diversity. Only a DNA-based procedure 
is financially feasible while additionally providing 
the large throughput work-flow required. Such a 
system overcomes the limitations of morphology- 
based identification and is independent of sys¬ 
tematic expertise or different states of phenotypes 
(i.e. life history stage or sex). COI was chosen 
because it is haploid, without recombination, and 
insertions/deletion mutations (indels) are rarely 
observed. Additionally, there are conserved regions 
for which universal primers that work across many 
taxa can be designed. Most importantly, the muta¬ 
tion rate of COI exhibits resolution at the species 
level. 

Single-locus barcoding has proven useful in a 
large dataset of about 100000 specimens of 3500 
morphologically defined species of moths, butter¬ 
flies, tachinid flies, and parasitoid wasps. Only a 
small percentage (<1%) could not be distinguished 
by their COI barcode (Janzen et al. 2009). The COI 
barcode did, however, assist in discovery of many 
cryptic species, later confirmed by morphological 
features previously assumed to be within species 
variation boundaries. 

Extensive sampling over large geographic areas 
covering whole taxonomic groups show that the 
observed power of species differentiation is not a 
sampling artefact. In a large study of 353 species 
of central Asian butterflies, allopatric species 
did not have less sequence diversity than sym¬ 
patric species (Lukhtanov et al. 2009). The large 
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geographical coverage did substantially increase 
intraspecific variation, thus reducing the barcoding 
gap (Fig. 15.1). However, this did not impede the 
use of barcoding for species identification, which 
had a success rate of >90%. 

In well-studied animal groups, such as birds, 
closely related sister species could be identified, and 
later confirmed by a coalescent test, with the single¬ 
locus COI barcoding approach (Tavares & Baker 
2008). However, a universal distance-based 'cutoff' 
approach was invalid based on variation of times to 
common ancestry among sister species and in rates 
of evolution. 

COI barcodes work well in birds and fishes, with 
birds providing a more defined species pattern than 
fishes (Ward 2009). The latter is caused by species 
diversity being less understood in fishes, includ¬ 
ing numerous cryptic species. DNA barcoding is 
clearly superior to morphological approaches to 
define species because, unlike barcoding, morpho¬ 
logical approaches are often distorted by homo- 
plasy (Packer et al. 2009). 

Plant mitochondria, however, demonstrate very 
slow mutation rates and cannot be used for DNA 


barcoding. Another organelle found in plants, 
chloroplasts, exhibit slower evolutionary rates in 
their genomes than mitochondrial genomes in ani¬ 
mals. Despite slower evolutionary rates, portions 
of the chloroplast matK gene and trnH-psbA spacer 
are useful barcoding genes in South African and 
Costa Rican plants (Lahaye et al. 2008). Notably, 
matK exhibited a barcoding gap between intra- 
and interspecific variants (with some overlap, see 
Figure 15.1) and has enough resolution to iden¬ 
tify more than 1000 Mesoamerican orchids. The 
identified orchids included cryptic species, some 
of which are listed as endangered in the Conven¬ 
tion on International Trade of Endangered Species 
(CITES). Adding trnH-psbA sequence variation to 
the matK data only slightly improves the power of 
the analysis. 

Recently, the CBOL (Consortium for the Barcod¬ 
ing of Life) Plant Working Group (2009) announced 
standardized DNA barcoding loci for plants using 
the chloroplast genes matK and rbcL. Despite the 
use of two loci, the species discrimination was only 
72% in a sample set of 259 specimens comprising 
95 species from 34 genera. This clearly performs 



Figure 1S.1 Overlap between intraspecific (grey) and interspecific (black) genetic variability for congeneric COI sequences in Diptera (adapted from 
Meier et al. 2006). 'Barcoding gap' indicates the approximate position and range of an expected barcoding gap. 
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worse than the COI locus in animals. Kress et al. 
(2009) used a three-locus approach (matK + r be L + 
trnH - psb A) and the BLAST algorithm to correctly 
identify >98% of the 296 species of woody plants 
in their study plot in Panama, but three genes can 
be expensive and time-consuming, thus challenging 
some of the basic barcoding objectives. 

Ecological applications have shown that bar¬ 
coding can identify the organisms contained in 
wild animal diets through sequencing the stomach 
extracts of species such as krill (Passmore et al. 
2006). Another projected advantage of the 'Barcode 
of Life' project was its ability to account for the 
diversity of unicellular life. As of yet, the barcod¬ 
ing of eukaryotic microorganisms has been poorly 
studied, but with emerging genomic technologies 
there is more promise in this field (Frezal and 
Leblois 2008). 

The pitfalls of (single-locus) barcoding 
Although COI is a good barcoding gene in a num¬ 
ber of animal groups (Hebert et al. 2003), there is 
evidence that COI barcoding does not always per¬ 
form as well in amphibians, for example (Vences 
et al. 2005a). COI divergence in amphibians is gen¬ 
erally high (7-14% within a species with a maxi¬ 
mum of 18%) and overlaps with interspecific diver¬ 
gence rates. This introduces high variation at the 
priming sites, making the successful amplification 
of COI fragments a challenge. The observed high 
divergence of COI in amphibians is not caused by 
excessive substitution rates but by deep mitochon¬ 
drial divergence and is intensified by frequently 
observed introgression and incomplete lineage sort¬ 
ing. This leads to common discordance between 
geographical signatures. High mitochondrial diver¬ 
gence within species has also been described for 
pulmonate land snails (Thomaz et al. 1996). Vences 
et al. (2005b) recommend the use of the mitochon¬ 
drial 16S rRNA gene as the barcoding locus for 
amphibians (and possibly other vertebrates). They 
show that 16S rRNA primer sites have fewer sub¬ 
stitutions and amplify DNA fragments more reli¬ 
ably than COI primers. Additionally, 16S rRNA is 
able to recover major monophyletic clades of ver¬ 
tebrates, distinguish between sibling species, and 
characterize individual populations whereas COI 
cannot. In land snails, 16S rRNA performs better 


than COI (Steinke et al. 2005). Jansen et al. (2009) 
were unable to use COI barcodes to delimitate 
Nearctic ant species and found a large overlap of 
intra- and inter-specific distances in Palaearctic ant 
species. Davison et al. (2009) emphasized that it is 
very important to have good baseline data in order 
to assign species. If that baseline is not available, 
species assignment will be incorrect. In Porifera and 
Anthozoa, mtDNA evolution is much slower than 
in other metazoans (Huang et al. 2008). In these 
groups, it is impossible to use COI to distinguish 
between species because several well-characterized 
species share the same barcode. 

Sequence units (genes) that exhibit good barcod¬ 
ing properties can vary in size and location between 
taxa. This decreases the usefulness of one single 
barcoding locus. Roe and Sperling (2007) investi¬ 
gated the COI and II loci and assessed their useful¬ 
ness for barcoding efforts. For both subunits they 
included the entire 2.3 Kb region and used a 600 
bp sliding window to analyse patterns of evolution. 
They found that locations of maximum divergence 
between taxa were highly variable and not con¬ 
fined to the DNA barcoding region of COI. Like 
others before them they found a major overlap of 
divergence within and between species. They also 
found areas of mutation saturation indicated by 
transition/transversion indices with only limited 
association to levels of DNA divergence. They con¬ 
cluded that no single 600 bp region of the COI and 
II genes is optimally informative across taxa and 
recommended pilot studies in the taxonomic areas 
of interest to identify the most informative regions. 
Additionally, they recommended analyzing several 
specimens from different populations to increase 
taxon sampling. 

For similar reasons criticism can also be directed 
towards the analysis of barcoding data as imple¬ 
mented in the Barcode of Life Datasystems (BOLD, 
www.barcodinglife.org). This database is charac¬ 
terized by a number of improvements to other 
genetic databases, i.e. the requirement of trace data 
submission, quality control of trace data, deposi¬ 
tion of voucher specimens from which genetic data 
originated, photo vouchers, etc. (Ratnasingham & 
Hebert 2007). The most common criticism is the 
use of distance methods to identify and delineate 
species. In a paper analyzing 1333 COI sequences 
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for 449 species of Diptera, Meier et al. (2006) found 
a relatively low species identification success rate 
(<70%) with tree-based identification criteria. They 
illustrated that pairwise distances or other simi¬ 
larity measures can lead to logically inconsistent 
results and that the often-applied neighbour-joining 
(NJ) clustering methods are insufficient to resolve 
data ambiguity as NJ methods generally result in 
only one tree and disregard other potential trees. In 
their dataset they found a wide overlap of intra- and 
interspecific COI sequences and found the species' 
COI divergence threshold (proposed at 3%) prob¬ 
lematic. However, DNA barcoding can be success¬ 
ful in taxa where taxon sampling is nearly complete 
(i.e. Aedes and Anopheles). 

Can single-gene/single-threshold criteria for 
species discovery work theoretically for biological 
species with reproductive isolation? To answer this 
question Hickerson et al. (2006) employed recip¬ 
rocal monophyly in tree-based analyses and a for¬ 
merly proposed 10 times rule (Hebert et al. 2004), 
meaning that average pairwise genetic differences 
between two species must be at least 10 times 
greater than the differences observed within each 
species. Theoretically, these criteria can identify 
reproductively isolated species with an error rate 
of less than 10%, but only if the isolation happened 
at least 4 million generations ago. For more recent 
species, this method is useless. 

To compute barcoding gaps, the intra- and inter¬ 
specific sequence variability for congeneric COI 
sequences and mean interspecific genetic distances 
are commonly used. Instead of mean interspecific 
genetic distance, the smallest distance should be 
used (Meier et al. 2008). This is necessary because 
the closest relatives within a genus need to be sep¬ 
arated. This approach will render species identi¬ 
fication more difficult as the number of species 
sampled per genus increases. Non-distance-based 
'character' methods can help to circumvent this 
problem. Instead of using distance-based meth¬ 
ods to delimitate species from single-locus mtDNA 
sequences, coalescence model-based methods may 
be more appropriate. In a dataset encompassing 
1614 individuals of insects from 12 families in four 
orders such a model was able to delineate 370 
putative species, which were confirmed by nuclear 
DNA variation and/or morphological characters 


(Monaghan et al. 2009). The taxa delineated rep¬ 
resent evolutionary coherent entities rather than 
similarity-based operational units. 

Nuclear mitochondrial pseudogenes (numts) 
can introduce ambiguity into DNA barcoding. 
Numts are easily coamplified with orthologous 
mtDNA, and barcoding with COI led to over¬ 
estimating the number of unique species in 
grasshoppers and crayfish (Song et al. 2008). Even 
after rigorous quality control for numts, such as 
screening for indels, in-frame stop codons, and 
nucleotide composition, not all numts were iden¬ 
tified. Such observations call for rigorous quality 
control in barcoding studies, including a number 
of approaches such as the use of mitochondria-rich 
tissue, the amplification of long DNA fragments, 
or the use of multiple loci not physically located 
near each other in the genome. Because no simple 
or rapid adjustment has been found, numts pose a 
serious challenge to DNA barcoding and will have 
to be considered in order to reduce incorrect molec¬ 
ular inferences. 

That universal primers frequently do not per¬ 
form well in metazoan groups, leading to the usage 
of degenerate primers combined with very low 
annealing temperatures, which decreases the strin¬ 
gency of DNA-fragment amplification, is another 
serious shortcoming in the light of incorrect molec¬ 
ular inference and fast, inexpensive DNA barcoding 
(Siddall et al. 2009). The loss of specificity can lead 
to amplification of contaminants, which then find 
their way into databases and confound the barcod¬ 
ing effort. 

Beyond (single-locus) DNA barcoding 
Frezal and Leblois (2008) provide an excellent 
review of the DNA barcoding field. They discuss 
the progress made and clearly point out the lim¬ 
itations of barcoding. As discussed earlier, these 
limitations include sampling shortage across taxa 
('sampling gaps'), nuclear copies of mitochondrial 
genes ('numts'), mtDNA based assignments, and 
the false assignment of sequences using distance 
methods. Additionally, there are inherent problems 
associated with describing new species from only a 
~ 600 bp fragment of mtDNA, without having an 
actual specimen 'in hand.' For example, mtDNA 
is not an adequate source of species-defining DNA 
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in cases of reduced populations size, introgression, 
inconsistent mutation rate, or heteroplasmy. Many 
of these conditions may be a common occurrence in 
populations being sampled for biodiversity (Rubi- 
noff et al. 2006). Because it is unlikely that the 
single-locus approach can be used across all eukary¬ 
otic systems, the idea of a multi-locus barcoding 
approach has been proposed. For example, in plants 
one could use an 'identifier' to provide for cor¬ 
rect assignments to higher taxonomic categories 
and varying 'identifiers' for species-level resolu¬ 
tion. While the value of the barcoding technique is 
less controversial when applied to the identification 
of species previously defined by other criteria, the 
application of DNA barcodes is still a contentious 
issue for defining novel biodiversity (Blaxter et al. 
2005; Rubinoff et al. 2006; Frezal & Leblois 2008). 

Parallel to the DNA barcoding initiative another 
large effort to assess biodiversity in the sea was 
initiated by Craig Venter (Venter et al. 2004; Rusch 
et al. 2007). His group chose a different approach, 
both in targeted organisms and technique. They col¬ 
lected marine microbial populations in water sam¬ 
ples and used shotgun cloning to sequence libraries. 
In their first attempt. Venter et al. were able to 
identify 1800 genomic species based on sequence 
relatedness, including 148 previously unknown 
bacterial phylotypes. It is important to realize that 
the methods used here are expensive, not standard¬ 
ized, and far from minimalistic. Venter et al. pre¬ 
dicted that the economics of sequencing will change 
dramatically and make large-scale genetic biodiver¬ 
sity assessments feasible. 

Single-locus DNA barcoding is not a silver bullet 
to genetically measure and assess biodiversity. It is 
successful in a few well- studied groups but is char¬ 
acterized by numerous limitations ranging from 
primers to analyses and is not applicable in some 
organism groups. The genetic barcoding system 
of the future should preferably include numerous 
independent (unlinked) loci with a genome-wide 
distribution. Additionally, markers should show 
different levels of resolution. Venter et al. (2004) 
came closest to this approach but were restricted by 
the inhibitive cost of shotgun cloning and Sanger 
sequencing. Next-generation sequencing technol¬ 
ogy will enable genome-wide barcoding in the near 
future. 


15.3.4 Genome technologies for biodiversity 
assessment 

The increased sequencing capacity offered by next- 
generation technologies will continue to expand 
our knowledge of biodiversity. Although most com¬ 
monly used in the description of microbial biodi¬ 
versity (see Chapter 16), genome technologies will 
facilitate future research in three key areas of bio¬ 
diversity assessment: (i) the measurement of biodi¬ 
versity, (ii) the maintenance of biodiversity, and (iii) 
the revival of biodiversity. 

The ability to identify species from DNA 
sequences is critical for measuring biodiversity. 
Whether using DNA barcodes or other loci, next- 
generation sequencing provides a diagnostic tool 
for rapid species detection. In a single reaction, 
next-generation technologies can analyse over 1 
million sequences (www.454.com). When screen¬ 
ing samples containing multiple species, individual 
sequences are compared with existing sequences to 
identify the species. For example, Valentini et al. 
(2009) and Deagle et al. (2009) used this approach 
to examine the biodiversity in the diet of herbi¬ 
vores and fur seals, respectively. Next-generation 
sequencing is also valuable for characterizing 
ancient biodiversity by sequencing DNA fragments 
from historical specimens (Millar et al. 2008). Using 
pyrosequencing. Blow et al. (2008) were able to 
classify an unidentified 69000- year-old tooth as 
belonging to an ancestor of steppe bison. While the 
identification of species is important for measuring 
past and current biodiversity, the analysis of com¬ 
plete genomes and genetic variation will benefit the 
maintenance of existing biodiversity. 

The maintenance, or conservation, of biodiver¬ 
sity is principally concerned with endangered or 
declining species. Recently, one of the first com¬ 
plete genome sequences for an endangered species 
was released, that of the giant panda (Ailuropoda 
melanoleuca-, Ruiqiang et al. 2010). The authors 
were the first to demonstrate the de novo assem¬ 
bly of a eukaryotic genome strictly from next- 
generation sequencing methods. In addition to the 
assessment of several unique evolutionary traits, 
the authors identified >2.7million SNPs. SNPs, 
or other genome-wide sets of genetic markers, are 
valuable for conserving biodiversity. For instance. 
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SNPs have already been used to suggest a 'marker- 
assisted' captive breeding strategy for the European 
and American bison (Pertoldi et al. 2009). Marker- 
assisted breeding strategies will reduce inbreed¬ 
ing, thus decreasing the expression of lethal alleles, 
such as chondrodystrophy in the California condor 
(Romanov et al. 2009). Genomics will also facili¬ 
tate studies of adaptive differences between pop¬ 
ulations. Compared to neutral markers, like most 
SNPs and microsatellites, adaptive loci can pro¬ 
vide better estimates of evolutionary differentiation 
resulting from ecological pressures (Gebremedhin 
et al. 2009). Genomic studies of adaptive loci in 
amphibians have been predicted to shed light on 
the variation responsible for certain populations' 
susceptibility to causes of decline (Storfer et al. 
2009). In the future, projects such as the initiative to 
sequence 10000 vertebrate genomes (Genome 10K 
Community of Scientists; Hayden 2009) will not 
only provide more neutral and adaptive markers, 
but advance our understanding of the mechanisms 
underlying the evolution of biodiversity. 

The final, and most controversial, application 
of genome technologies to studies of biodiver¬ 
sity is the revival of extinct species. The resurrec¬ 
tion of extinct species became a reality when the 
Pyrenean ibex (Capra pyrenaica pyrenaica ), extinct 
as of 2000, became un-extinct for several minutes 
in 2009 before the cloned newborn died of lung 
failure (Folch et al. 2009). Albeit nuclear-transfer 
techniques were performed to clone the ibex; it 
may soon be possible to regenerate biodiversity 
simply with a genome sequence. Nicholls (2009) 
pointed out the inherent difficulties in resurrect¬ 
ing the extinct woolly mammoth from a genome 
sequence, although its feasibility was not entirely 
ruled out. The revival of extinct biodiversity raises 
many contentious biological and ethical concerns, 
but its impact on biological diversity cannot be 
ignored (for a review see Pina-Alguilar et al. 2009). 


15.4 Prospectus 

Biodiversity assessments from genetic methods in 
the near future will certainly continue to rely on 
technologies such as DNA barcoding and whole- 
genome sequencing, or even a combination of the 


two where DNA barcoding takes on a more whole- 
genome character. 

For DNA barcoding the silver lining on the not 
too far away horizon is next-generation sequencing, 
with its precipitous drop in sequencing cost. This 
cost reduction equalled four orders of magnitude 
for the last decade and leads to proposals like the 
10 000 genome project in which scientists are aiming 
for 10000 complete genomes across representative 
vertebrate species (Genome 10K Community of Sci¬ 
entists 2009). Proposals like this one would have 
been science fiction a few years ago but are now 
in close reach, only requiring one more order of 
magnitude drop in cost before realization. Now it 
is easy to imagine DNA barcodes that are based on 
SNPs with a genome-wide coverage. 

The availability of whole genomes represents a 
quantum leap in genetic biodiversity assessment, 
as did the complete human genome for med¬ 
ical research. As these whole-genome sequences 
become available they will advance our under¬ 
standing of the genetic variation that has been so 
elusive for the past several decades, that of adap¬ 
tive variation. With this increased understanding of 
adaptive variation will come a better understand¬ 
ing of the evolutionary process that generated the 
existing biodiversity. The question of whether bio¬ 
diversity should be measured in 'species' or 'geno¬ 
types' becomes philosophical. In reality we will 
work with genotypic data as it will be readily avail¬ 
able. Finally, the future holds the controversy of 
whether or not to use genomic technologies to res¬ 
urrect extinct biodiversity. 


15.5 Key points 

1. The genetic diversity of individuals and pop¬ 
ulations is an important component of species 
diversity, as it is important to the diversity of the 
biological community. 

2. Genetic diversity is measured using a variety of 
molecular genetic markers, each with a different 
evolutionary rate giving them different resolving 
power. Markers with the appropriate resolution 
are selected for different biodiversity questions. 

3. Genetic analyses are a powerful tool for reveal¬ 
ing the evolutionary history and taxonomic 
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position of species, particularly the cryptic ones 
that cannot be detected by other means. 

4. Non-invasive methods to assess biodiversity in 
the wild become more important as newer tech¬ 
nologies allow us to retrieve increasingly more 
information from less sample. As molecular 
techniques become easier to use and less expen¬ 
sive, they will become more available to biolo¬ 
gists and managers. 

5. The value of barcoding has been widely accepted 
when applied to the identification of species 
previously defined by other criteria, but is 


controversial when used for novel species iden¬ 
tification. However, barcoding is an emerging 
technology that shows promise in the field of 
genetic biodiversity assessment, especially once 
genome-wide barcodes can be implemented. 

6. The developing genomic technologies are impor¬ 
tant for measuring past and current biodiver¬ 
sity, as well as measuring existing biodiversity 
of neutral and adaptive variation. Quite possi¬ 
bly, in the future it might be possible to regen¬ 
erate extinct biodiversity by generating an entire 
genome sequence. 



PART V 
Applications 






CHAPTER 16 


Microbial diversity and ecology 

Lise Ovreas and Thomas P. Curtis 


16.1 Introduction 

Microbial diversity is surely the outermost fron¬ 
tier of the exploration of diversity. The scale 
and inscrutability of the microbial world have 
deceived some into thinking there is nothing 
very interesting there! 

... '7 do not discuss the biodiversity of microorgan¬ 
isms at all. From the point of view of diversity, they 
are probably the most poorly known of taxa. Perhaps 
their diversity shows many patterns but I am unaware 
of them." 

Michael Rosenzweig 'Species Diversity in 
Space and Time' Rosenzweig (1995) 

Nothing could be further from the truth. However, 
the exploration of this frontier presents the investi¬ 
gator with multiple challenges, which are, little by 
little, being considered, constrained, and obviated. 
Although great progress has been made there seems 
little doubt that the golden age of microbial diver¬ 
sity lies before us and not behind. 

Microorganisms are not only the oldest and most 
diverse living organisms on the planet, but they also 
represent the vast majority of the Earth's biodiver¬ 
sity and are the driving forces in major biogeochem¬ 
ical processes. For approximately 4.5 billion years 
the Earth has been evolving from an unproductive 
volcanic landscape into the lively globe full of life 
that it is today. The first forms of life, prokaryotes, 
have been found in fossils from 3.5 billion years 
ago, and evidence for photosynthesis and earliest 
life have been reported back at 3.8 billion years 
in the Isua Greenstone Belt in Greenland (Rosing 
& Frei 2004). For more than 2 billion years life, 
and evolution, on the planet was solely prokaryotic. 
Microorganisms began to evolve with the chang¬ 
ing environmental conditions of the planet. New 


environmental conditions created by microorgan¬ 
isms created new selective pressures driving the 
evolutionary process and creating, over millen¬ 
nia, new orders, families, genera, and species. Yet 
we know very little about microbial diversity—we 
have only the slenderest notion of its extent and 
know still less about how it is generated and main¬ 
tained. The assessment of biodiversity is one of the 
most challenging and fascinating aspects of micro¬ 
biology. The scale of our ignorance is reflected by 
the range of estimates of the number of bacterial 
species that exist on Earth today, varying between 
10 4 and 10 9 . Our lack of enlightenment is partly due 
to the isolation of microbial diversity studies from 
the general study of biodiversity, but is also perhaps 
because of microbial diversity's shear inscrutability: 
the absence of a species concept, the impossibility 
of morphological examination, and the inadequacy 
of culture-based methods has forced microbial ecol¬ 
ogists into the arms of the molecular biologists, 
whose methods have traditionally been applied to a 
narrow range of conserved genes. The tools of mole¬ 
cular ecology are only partially suited to the scale of 
the microbial communities microbial world and can 
offer an at best limited, and at worst totally false, 
view of the microbial world. Although great strides 
are being made by focusing on the occurrence and 
diversity of a handful of genes (typically 16S rRNA) 
there is so much more to the microbial world than 
this molecule. 

Times are changing, however, and the rational 
structured and adequate use of molecular tools 
to explore ecology is foreseeable, and in certain 
instances possible. The proper determination of 
microbial diversity is a key part of that change. We 
will in this chapter lay out the bare bones of the 
conceptual foundations, laboratory techniques, and 
mathematical methods required, but this is only a 
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Figure 16.1 The tree of life shows how much greater the diversity of the microbial world is compared to the diversity of larger organisms. The Archaea 
and the Bacteria in particular account for the bulk of the tree. 


beginning. This is science not accountancy. It would 
be a mistake to collapse all microbial ecology into a 
number or even a few numbers for they could not 
possibly capture the wonder and complexity of the 
microbial world. 

Our most primitive scratchings at the surface of 
the problem have amazed us. It is not only the 
great diversity that is surprising (Fig. 16.1) but also 
the revealing of completely new forms of life. Who 
knows what treasures future generations will find 
when they explore systematically, and in earnest, 
equipped with a sound grasp of the microbial diver¬ 
sity around them. 

The focus in this chapter will be on prokaryotic 
microorganisms. Their importance is noticeable as 
they represent by far the largest phylogenetic diver¬ 
sity on Earth, are vehicles for all the biogeochemical 
cycles, and regulate nutrient cycling and nutrient 
pools for eukaryotes. 

16.2 The diversity concept 

The term 'microbial diversity' describes differ¬ 
ent aspects of complexity and variability within 


microbial populations and communities. This 
includes genetic variability within taxa (species), 
variability in community composition, and com¬ 
plexity of interactions, trophic levels, and num¬ 
ber of guilds. Traditionally biodiversity is based 
on the 'species' as a unit. In microbial ecol¬ 
ogy the species concept is useless as the species 
concept for bacteria is obscure. In microbiology 
the term 'diversity' is used to describe qual¬ 
itative variation among microorganisms as we 
draw on more abstract and arbitrary measures of 
evolution. 

Because of difficulties with classification, numer¬ 
ical taxonomy (using mathematical algorithms 
like cluster analysis) is often used for bacter¬ 
ial systematics. With this method the differences 
between bacterial isolates are calculated and are 
then clustered into biotypes. A biotype is an oper¬ 
ational taxonomic unit (OTU) that can be used 
instead of a species to describe and compare 
microbial populations and communities. Diversity 
may also be represented as phylogenetic trees, 
or appreciated from the number of functional 
groups. 
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16.3 Phytogeny 

The measurement of microbial diversity is rooted in 
evolutionary relationships as comparative analyses 
of specific genes can be used to infer the evolution¬ 
ary pathways of organisms (see Chapter 14). This 
approach, termed molecular phylogeny, reveals 
the history of organismal lineages as they change 
through time, and is based on the number and loca¬ 
tion of differences in the nucleotide sequences of 
homologous genes from different organisms. The 
prime goal of molecular phylogeny is to establish 
a framework into which organisms are grouped 
according to their evolution, and show that all life 
is genetically connected via a vast phylogenetic 
tree (see Chapter 15). It is generally assumed that 
early life forms originated from a gene pool repre¬ 
senting common evolutionary ancestry. Classifica¬ 
tion of microorganisms should be explicit and fol¬ 
low accepted nomenclature guidelines. Individuals 
showing high degrees of similarity are grouped, 
defining the lowest unit of classification. In biology 
the basic taxonomic unit is the species, however a 
problem herein is that the classification of prokary¬ 
otic microorganisms has been rather ambivalent 
and the species concept ambiguous. Still, the tree 
of life provides an accurate framework to direct 
research in all biological sub-disciplines, and repre¬ 
sents an ideal model for the organization of biolog¬ 
ical knowledge at different levels. 

16.4 rRNA as an evolutionary 
chronometer 

There are several evolutionarily conserved genes 
that are present in most organisms and certain cri¬ 
teria are important for a molecule to be regarded 
as an evolutionary chronometer. In order to deter¬ 
mine the evolutionary relationship it is essential 
that the gene chosen will (i) be universally dis¬ 
tributed between organisms, (ii) have a defined 
structure and function, (iii) be present in large 
numbers per cell, (iv) keep stable with time, and 
(v) have a sequence that changes at a rate cor¬ 
responding with the evolutionary distance being 
measured (Zuckerka & Pauling 1965; Olsen et al. 
1986). The most favoured, and thus most impor¬ 
tant, gene specifies the sequence for the ribosomal 
RNA molecule and it encodes a specific component 


of the protein-synthesising ribosomes. By compar¬ 
ing ribosomal RNA (rRNA) sequences Carl Woese 
established a molecular sequence based phyloge¬ 
netic tree that could be used to relate all organisms 
and reconstruct the history of life (Woese 1987). 
The rRNA molecule evolves at a particular rate 
and some parts of the molecule are more impor¬ 
tant to its function than others. As a result the 
functional regions of the molecule are almost iden¬ 
tical in sequence from the smallest bacterium to 
the largest animal. rRNA has attracted considerable 
interest and is today the primary tool for measuring 
diversity and the proxy for the species concept for 
microorganisms. When rRNA sequences from dif¬ 
ferent organisms are compared to each other, the 
number of changes in the sequence can be used to 
create a phylogenetic tree. 

16.5 Methods for assessing diversity 

The insight that sequences rather than cultures 
could be used to infer differences between organ¬ 
isms has been liberation for microbial ecologists. 
Unlike macroorganisms, most microbes cannot 
be taxonomically and metabolically categorized 
according to directly visible phenotypes. Tradition¬ 
ally, to identify an organism it was necessary to 
obtain a pure culture and to conduct a battery of 
tests to determine its core identities and physio¬ 
logical properties. This is not only very laborious 
but a highly partial process as most species can¬ 
not (or have not) been cultivated in the labora¬ 
tory (Pace 1997). Culturing conditions select for a 
distinct subpopulation of the bacteria present in 
the given environment (Staley & Konopka 1985) 
Cultivation-dependent methods therefore only give 
information about those few organisms able to 
grow under the given conditions. This is a highly 
biased approach and only a limited fraction of 
microorganisms have, or ever will be, fully char¬ 
acterized in this manner (Amann et al. 1995). This 
challenge in cultivation has skewed our knowledge 
of microbial diversity and limited our understand¬ 
ing of the microbial world. 

16.5.1 PCR-based methods 

The most common approach for assessing micro¬ 
bial diversity is to use polymerase chain reaction 
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(PCR) to amplify 16S rRNA genes (rDNA) from the 
community (Pace et al. 1986) The amplified genes 
can then be cloned, into £. coli, and identified by 
sequencing using classical sequencing technologies 
and then compared, by alignments (Hugenholtz & 
Pace 1996) to other taxa. The advantage of this 
method is that a large fragment of the gene can be 
unequivocally analysed. The disadvantage is that 
the cloning and sequencing is time consuming and 
labour intensive. Although clone libraries of hun¬ 
dreds or more are technically feasible they are sel¬ 
dom undertaken for more than a site or two in most 
studies and most laboratories can, or could, only 
manage tens of clones per sample if more than a 
handful of samples were to be analysed. 

Consequently, community fingerprinting tech¬ 
niques have gained favour. These methods are 
quick, simple and cheap, and as such can be 
used to study many sites in space and time. 
They include denaturant gradient gel electrophore¬ 
sis (DGGE; Muyzer et al. 1993), terminal restric¬ 
tion fragment length polymorphism (T-RFLP; Liu 
et al. 1997), single stand conformation polymor¬ 
phism (SSCP; Schwieger & Tebbe 1998), ribosomal 
intergenic spacer region (RISA; Ranjard et al. 2000), 
and automated RISA (ARISA; Cardinale et al. 2004). 
All these methods provide information about the 
numerically dominant community members based 
on sequence differences in the genes of interest. 
Differences in sequences are detected manually by 
eye or automatically by machine and the sequence 
variations are governed by both the detection limit 
and the resolution level of the method. Thus only 
the most abundant taxa will be characterized by 
these methods (see flow diagram of the meth¬ 
ods, Fig. 16.2, and detection limits and thresholds. 
Table 16.1). Although all such methods are flawed 
in some respect, they may nevertheless be tenta¬ 
tively used. 

These intermediate resolution methods resolve 
the diversity of microbial communities and are 
based on differences in conserved genes like 16S 
rRNA genes and functional genes. Such molec¬ 
ular methods give qualitative information about 
the organisms present and have revolutionized our 
understanding of the diversity and distribution of 
bacteria, archaea and microbial eukaryotes. The 
results obtained using these methods has demon¬ 


strated that the analysis of microbial model sys¬ 
tems can provide important insights into ecosystem 
function and stability. However, the terminology 
and metrics used in macroecology must be applied 
with great caution because the methods available to 
characterize microbial communities are still in their 
infancy. 

The ability of microbial ecologists to quantify 
diversity and test hypotheses regarding patterns 
and processes in microbial communities depends 
to a great extent on their ability to characterize 
the diversity and distribution of microbes in a 
wide range of habitats. Typically, investigators are 
interested in the relative diversities of communities 
across a gradient of stress, disturbance and other 
biotic or abiotic differences. It is difficult with cur¬ 
rent techniques to study the true diversity since we 
do not know what is present and we have no way 
of determining the accuracy of our extraction or 
detection methods. Current studies on the diversity 
of prokaryotes often employ methods based on the 
analyses of 16S rRNA genes, as these allow inves¬ 
tigators to detect and quantify phylotypes that are 
inaccessible by culture in the laboratory. In this way 
a more comprehensive assessment of diversity can 
be obtained than was previously possible, leading 
to the improved understanding of the extraordinary 
richness of prokaryotic biodiversity (Woese 1987; 
DeLong & Pace 2001; Wellington et al. 2003; Orem- 
land et al. 2005; Nakatsu 2007). Although progress 
has been made towards an understanding of micro¬ 
bial diversity, it has been faltering and in many 
ways it is flawed. Sample sizes are still dictated by 
what is feasible, not what is required. Attempts to 
mathematically extrapolate data are often inconsis¬ 
tent, with the inherent limitations of sample sizes 
ignored and the crucial difference between a sample 
and the environment typically unacknowledged. 
The application of these contemporary methods has 
demonstrated that the extent of prokaryotic diver¬ 
sity in most habitats is almost beyond our under¬ 
standing, with complex habitats containing an esti¬ 
mated 10 4 -10 6 taxa in a single gram (Dykhuizen 
1998; Torsvik et al. 1998; 0vreas et al. 2003; Gans 
et al. 2005). The Earth's biosphere contains more 
than 10 30 individuals (Whitman et al. 1998) and an 
untold number of species. The key to the efforts to 
characterize this extraordinary diversity is that the 
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Figure 16.2 The array of tools available to the microbial ecologist exploring 
See Table 16.1 for more details on the performance of each method. 

observed and inferred rank abundance distribution 
for most communities show a long tail of numeri¬ 
cally minor species or phylotypes (Mac Arthur 1960; 
Curtis & Sloan 2004). This means that most native 
microbial communities are dominated by a small 
number of species whereas the vast majority of pop¬ 
ulations are rather uncommon. 

16.5.2 Pyrosequencing 

Novel sequencing technologies such as massively 
parallel pyrosequencing have brought sequenc¬ 
ing to yet another level, with a throughput of 
up to 400 million high-quality bases in 10 hours. 
This is the sequencing scale we require to explore 
the extent of microbial diversity. A few years 
ago 'pyrosequencing' (http://www.454.com) was 
developed as a remarkably inexpensive approach 


microbial diversity. The relationships between them are large and increasing. 


to shotgun style studies. The advantage of this 
pyrosequencing is its significantly lower price 
and the higher number of sequences that can 
be obtained quickly. Recent dramatic advances in 
DNA sequencing technology including the 454 
Life Science pyrosequencing techniques, have made 
metagenomic analysis of environmental samples 
an achievable reality. Because of this, an expo¬ 
nential increase in sequencing data has been gen¬ 
erated. A traditional Sanger sequencer produces 
(per run) approximately 70Kbp, whereas a 454 
pyrosequencer produces 100 Mbp. The bottleneck 
will therefore rapidly become computational. In 
order to make sense of these massive data sets, 
mathematical modelling will assume a central role 
in microbial ecology. These newer methods are 
prone to sequencing error and this may have dis¬ 
torted the number of singletons in early reports of 






Table 16.1 Summary of the available technology for determining diversity in microbial samples. 



simultaneously Only most abundant taxa 

One band can represent 
more than one species 
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diversity using this technology. However, this is not 
an insuperable problem (Quince et al. 2009) and 
seems unlikely to affect the scale of estimates of 
diversity using this technology (0vreas et al., in 
preparation). 

The advent of 454 sequencing could finally place 
the rational exploration of microbial diversity on a 
scale corresponding with the magnitude of the task. 
The technology permits the deep sequencing of a 
specific gene amplified from community DNA. The 
diversity of this gene is an indicator of the under¬ 
lying diversity of the community. The analysis of 
the few very large datasets generated using this 
technology has been relatively unsophisticated and 
the results only qualitatively confirm long-standing 
theoretical estimates that the diversity would be 
big. Quince and collaborators (Quince et al. 2008) 
recently presented a more rational and systematic 
approach to the exploration of microbial diversity 
in which significant samples were analysed using 
appropriate sample sizes. 

16.5.3 Metagenomics 

In principle the analysis of total DNA extracted 
directly from a community generates information 
derived from all the community members, provid¬ 
ing estimates of the microbial diversity and a broad 
picture of the native microbial community compo¬ 
sition. Diversity information stored in nucleic acids 
can give information at different resolution lev¬ 
els. DNA extracted directly from an environmen¬ 
tal sample can be regarded as one large microbial 
community genome—often called the metagenome. 
The metagenome approach represents a different 
whole community DNA-based analysis. Not only 
does it circumvent the cultivation anomaly, but 
it also avoids the PCR biases by cloning and 
sequencing genes directly from the environmental 
DNA. Metagenomics provides a relatively unbiased 
view not only of the community structure (species 
richness and distribution) but also the functional 
(metabolic) potential of a community. In practice 
such information is hard to obtain and hard to 
analyse. 

The first metagenomic analysis presented is that 
of Torsvik and collaborators (Torsvik et al. 1990b), 
in which the prokaryotic community from soil 


was analysed based on its DNA composition. This 
method makes the assumption that the extracted 
community genome contains all information about 
the diversity in that specific community. The total 
genetic diversity was therefore measured by rean- 
nealling of DNA, as this rate is dependent on the 
amount of homologous DNA in a given sample. 
This rate can then be used to calculate genome 
size or more precisely genome complexity in that 
sample, and is referred to as the community DNA 
reassociation method. The main advantage of the 
reassociation method is that it offers an estimate 
of the extent of diversity in the prokaryotic com¬ 
munity. The method basically provides informa¬ 
tion about the total number of completely differ¬ 
ent genomes present in the environmental sample 
but it also tells us something about the distribu¬ 
tion of different genomes based on the slope of the 
curve. This method is a broad-scale, low-resolution, 
cultivation-independent approach for assessing the 
diversity of microbial communities. 

Another metagenome approach involves con¬ 
struction of complex community libraries by direct 
cloning of large genomic DNA fragments (40-80 kb) 
from environmental samples into fosmid or bac¬ 
terial artificial chromosome (BAC) vectors. This 
approach can be used to generate information on 
the potential functioning of individual microbial 
species in natural environments in order to study 
the broader role of microorganisms in the ecosys¬ 
tem (Rondon et al. 2000; Tyson et al. 2004; Tringe 
et al. 2005). 

Although still in its infancy, such metagenomic 
analyses have already contributed to our knowl¬ 
edge of genome structure, population diversity, 
gene content and the composition of naturally 
occurring microbial assemblages. In low complex¬ 
ity populations, metagenomic studies have led to 
the assembly of almost complete genomes from the 
abundant genotypes and have provided compos¬ 
ite genomic representation of the abundant geno¬ 
type. Extreme environments such as acid mine 
drainage (Tyson et al. 2004), anaerobic methane oxi¬ 
dizing zones in the deep-sea (Hallam et al. 2004) 
and geothermal environments (Bhaya et al. 2007) 
have played a leading role in the metagenomics 
area. In many cases the restrictive geochemistry of 
these environments limits microbial phylogenetic 
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diversity. Such relatively low diversity systems 
are excellent templates for using metagenomics to 
explore the roles of microheterogeneiety in ecosys¬ 
tem resilience and stability. One of the most inter¬ 
esting studies has been focused on the marine 
environment: the largest metagenomic study to 
date is the Global Ocean sampling expedition, 
which followed the voyage of Darwin's ship HMS 
Beagle. 

Early metagenomic analyses provided some very 
significant insights into ecosystem function. Per¬ 
haps the most striking example is that of the dis¬ 
covery of proteorhodopsin. Proteorhodopsin are 
retinal-binding photoproteins that function as light- 
driven protonpumps. The genes encoding prote¬ 
orhodopsin were originally found in the marine 
environmental BAC library (Beja et al. 2001), 
resulting in the detection of a novel type of 
phototrophic organism in the ocean. Later com¬ 
parisons of genomic approaches have demon¬ 
strated that proteorhodopsin are widely distrib¬ 
uted among divergent bacterial taxa in marine envi¬ 
ronments and that this process contributes signifi¬ 
cantly to the phototrophic processes in the ocean. 
Metagenomic studies have also led to the recog¬ 
nition of the novel anaerobic methane oxidiza¬ 
tion metabolism or 'reverse methanogenesis' per¬ 
formed by Archaea in deep-sea sediments (Hal- 
lam et al. 2004) and to the prediction that non- 
thermophilic archaea from soil and marine plank¬ 
ton gain energy through the oxidation of ammonia 
(Schleper et al. 2005). 

Another approach is to look into the func¬ 
tional genes in a microbial community using pro- 
teomics. Proteomics provide direct information on 
the dynamic protein expression in tissue or whole 
cells. Together with the significant accomplish¬ 
ment of genomics and bioinformatics, the sys¬ 
tematic analysis of all expressed cellular compo¬ 
nents has become a reality in the post-genomic 
era, and attempts to grasp a comprehensive picture 
of the biology have become possible. One impor¬ 
tant aspect of proteomics is to characterize proteins 
differentially expressed by dissimilar cell types or 
cells imposed to different environmental condi¬ 
tions. In this way changes in protein expression 
under different growth conditions can be studied. 
Proteins induced under exposure to external stress 


factors, such as pH stress, nutrient limitation or 
the presence of metals, have been reported (Amaro 
et al. 1991; Seeger & Jerez 1992; Vera et al. 2003). 

In almost all of the metagenomic and proteomic 
studies a separate accompanying molecular typ¬ 
ing method—usually based on the PCR-amplified 
16S rRNA gene—is needed to characterize the gene 
discovery in the context of the microbial phyloge¬ 
netic diversity of the sample (Venter et al. 2004; 
Tringe et al. 2005; Gill et al. 2006; Huber et al. 2007). 
However, DNA-based metagenomic and diversity 
studies do not allow us to draw the conclusions 
on the expression state of the genes, and there¬ 
fore the functional role of genes or organisms 
in the investigated environments remains uncer¬ 
tain. A logical next step in the metagenomic area 
therefore includes metatranscriptomics technology 
(Urich et al. 2008). In this study the total RNA 
pool of a soil microbial community of both func¬ 
tional and taxonomically relevant molecules, i.e. 
mRNA and rRNA, was enriched and analysed 
in a 'double RNA' approach. In this way Urich 
and his co-workers were able to link commu¬ 
nity structure and function in one single exper¬ 
iment. Using this approach they found a high 
level of activity not withstanding the relative low 
abundance of the Crenarchaeota in the ecosystem 
(Urich et al. 2008). 

16.6 Sampling, scale, and thresholds 

Sampling is of central importance to the estimation 
of diversity at all living things (see Chapter 2 and 
3). However, the disparity between the number of 
individuals in the microbial world and the number 
of sequences we can examine deserves special con¬ 
sideration. The species patterns seen in a sample, 
even quite a large sample, may look nothing like 
the community from which they are drawn. The 
advent of much larger sample sizes has not neces¬ 
sarily obviated this problem (Fig. 16.3). This plainly 
casts doubt on the naive use of species abundance 
curves, diversity and similarity indices, and the 
other paraphernalia of macroecology. In addition to 
the problem of sample size, one must also consider 
the detection threshold. For clone libraries and anal¬ 
ogous methods such as pyrosequencing the thresh¬ 
old is simply the reciprocal of the size of the library. 
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Figure 16.3 Simulated samples (left column) from artificial communities (right column) with similar diversities and very different species abundance 
curves (uniform, log-series, and log-normal). The sample distributions are superficially similar even with an apparently large sample size of 15 000 reads 
(with thanks to Steve Woodcock for running the simulations). 


Thus a library of 1000 sequences has a detection 
limit of 1/1000. However, molecular fingerprinting 
techniques such as DGGE, ARISA and T-RFLP are 
subject to a detection threshold of about 1% abun¬ 
dance (Muyzer et al. 1993; Bent et al. 2007). Thus, 
when we naively attempt to infer changes in species 
richness using these methods we are really just eval¬ 
uating the number of taxa occurring above or below 


this threshold. The number of species above or 
below the detection limit the detection limit reflects 
a change in the distribution of taxa in the sample, 
which may, or may not, be associated with a change 
in richness. It could just be a change in distribu¬ 
tion. Moreover, whilst a reduction in the number 
of detectable taxa is assumed to be associated with 
a loss of diversity, it could also be associated with 
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an increase in diversity as the associated increase in 
evenness drops more taxa below the threshold. 

Thresholds and sampling issues cast a pall over 
all contemporary studies of microbial diversity 
(Woodcock et al. 2007). Microbial ecologists would 
do well to be cautious and qualified in their claims. 
It will be interesting to see how much of what we 
think we know will survive re-examination with 
adequate sample sizes supported by adequate sta¬ 
tistical analysis. 

16.7 Mathematical tools for estimating 
diversity 

It should now be apparent that we can virtually 
never measure diversity directly. Rather, it must be 
estimated or inferred from the data available to us. 
The excitement of the outstanding insights of the 
latest batch of sequencing technology should not 
blind us to the fact that these methods cannot, as 
currently deployed, directly determine the diversity 
of a community. However, the new generation of 
sequencing does mean that these estimates are now 
far more credible and can, in principle at least, be 
rationally verified. 

The mathematical tools we can use can be 
divided into four groups: 

1. collectors' curves 

2. Chao and Chao-type non-parametric estimators 

3. parametric estimators that assume a distribution 

4. parametric estimators that infer a distribution. 

16.7.1 Collectors curves 

A simple and intuitive way to evaluate a collec¬ 
tion of sequences is to plot the number of differ¬ 
ent sequences on the y-axis against the total num¬ 
ber of sequences examined. The asymptote of the 
curve is then estimated and taken to represent the 
total diversity (Colwell & Coddington 1994). This 
method, reviewed in detail in Chapter 4, is quite 
incapable of dealing with the scale of diversity of 
the microbial world. The logarithmic nature of the 
distribution of the microbial world means that the 
asymptote is apparently approached long before all 
the diversity is sequenced. This may not be obvious 
at first. Imagine a sample with 7000 species. Now 
imagine that species of modal rank abundance are 


10000 times less abundant than the most abundant 
taxon. When only half the taxa have been discov¬ 
ered, a new sequence is only going to be found 
once every 10 4 sequences. This would be a very 
flat collectors curve! The estimation or inference of 
such asymptotes has been very deceptive, leading 
authors to quite incorrectly believe they have nearly 
all the diversity in a sample (Roesch et al. 2007). 
These and allied methods are in principle and in 
practice a wholly inadequate way of determining 
species richness. 

16.7.2 Chao's non-parametric estimators 

There are a number of related methods, developed 
by Anne Chao, that are in many ways one of the 
intellectually most pleasing tools in the toolbox 
(Chao 1987). Their primary advantages are that 
they make mathematically infallible estimates of the 
lower bounds of the diversity (i.e. the diversity, 
whatever it is, cannot be less than the value esti¬ 
mated) and they are very simple to execute. This 
robustness arises because they make no assump¬ 
tion about the underlying distribution of the taxa 
(which is why they are termed non-parametric). 
This approach is described in more detail in Chap¬ 
ter 4 and exemplified by the Chao 1 method that 
determines St, a lower bound estimator for the total 
diversity, from S, the number of species observed in 
the sample and the number of taxa observed once 
(/i) and observed twice {fz): 

$r = S+^ (16.!) 

2 h 

There are a number of such methods but the esti¬ 
mates they make are all sensitive to the underlying 
distribution. This sensitivity to distribution can be 
quite profound. Schloss and Handelsmann (Schloss 
& Handelsman 2006b) used simulations to show 
that a sample with 5000 species and a log-normal 
distribution could require anything between 18000 
and 40 000 clones to find the species richness. How¬ 
ever, only 150 clones would be required if the dis¬ 
tribution was even. Thus, although in principle this 
estimator requires a sample size of at least the 
square root of twice the diversity, in reality much, 
much larger sample sizes are usually required. 
The sensitivity to distribution varies slightly from 
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method to method so it is better to choose one 
method and stick to it. For the microbial ecolo¬ 
gist, these are great, quick and dirty tools for the 
estimation of the lower bounds of diversity in a 
sample, as long as one remembers that this is a 
lower bound. At present it appears that, even with 
pyrosequencing and sample sizes of the order of 
tens or even hundreds of thousands these methods 
underestimate, but typically by less than an order of 
magnitude (Quince et al. 2008). It is not possible to 
use these methods to determine if you should take 
a bigger sample or to say how big a sample you 
should take in a subsequent study or to extrapolate 
to sample sizes for metagenomic studies. They will 
also be particularly sensitive to sequencing error, 
increasing the number of apparent singletons. This 
is a particular problem with the new generation of 
sequencing technologies. Thus, as a rule of thumb 
one can never go wrong with non-parametric esti¬ 
mators, equally one can never know if one is right. 
One can only be sure that the diversity, whatever it 
is, is very probably bigger than the estimate of these 
non-parametric estimators. 

16.7.3 Parametric estimators that assume 
a distribution 

Parametric estimators assume that the relative 
abundance follows a particular distribution. The 
best known is probably that of Curtis and cowork¬ 
ers (Curtis et al. 2002) and was inspired by, but 
is distinct from, the method of Preston mentioned 
in Chapter 4. This procedure assumes a log-normal 
taxa abundance distribution and is, in principle, 
very simple. A taxa abundance distribution self- 
evidently describes the number of taxa occurring 
at a given abundance. One way to describe this 
relationship is as a graph in which the abundance 
is plotted on the y-axis and the number of taxa at 
that abundance is plotted on the x-axis. The num¬ 
ber of species or taxa is simply the area under this 
curve. Thus, to determine the total diversity in the 
sample or environment of interest it is not neces¬ 
sary to count every species, merely to know the 
taxa abundance pattern and then determine the area 
under the curve. The disparity between the size of 
the microbial world and the sample sizes deployed 
to describe it is such that the no-one has directly 


observed the taxa abundance curve of any micro¬ 
bial sample or community (but see below). We can, 
however, assume a distribution and there are plau¬ 
sible reasons for assuming that species abundance 
curves for rapidly growing organisms are typically 
log-normal (MacArthur 1960; Curtis et al. 2002) and 
this distribution appears to be the most likely in 
at least some microbial communities (Quince et al. 
2008). 

To estimate the diversity of a community one 
must first estimate the area under the species abun¬ 
dance curve using the data available. Fortunately 
there is a relationship between the area under a 
species abundance curve (which we don't know) 
and the total number of individuals in a sample, 
which we do know. 

It is possible to show that there is a mathemat¬ 
ical relationship between the diversity of the sam¬ 
ple or environment (Sr) and the ratio of the total 
number of individuals in a sample (Nr) and the 
abundance of the most abundant taxon ( N, mx ). The 
one unknown in this relationship is the standard 
deviation of the species abundance curve, which 
is dictated by the abundance of the least abun¬ 
dant taxon or N,„,„ and is typically assumed to 
be one species at an abundance of one. This is 
patently a blind assumption, but the value of Sr is 
not very sensitive to modest departures from this 
guesstimate. 

The upshot of this approach is that one can, under 
these assumptions, infer the diversity of any com¬ 
munity or sample as long as one can estimate the 
total number of individuals the proportional abun¬ 
dance of the most abundant taxon. One can thus 
consider the diversity of an entire sea or a diversity 
of a drop thereof (Fig. 16.4). 

The estimates produced on this basis are crude 
ballpark estimates that lack statistical rigour. How¬ 
ever, they were and are consistent with the esti¬ 
mates of the Bergen group based on DNA:DNA 
hybridization showing that the diversity of soil 
massively exceeds the diversity of samples of sea¬ 
water and that the latter could easily exceed 3- 
4000 taxa in a few grams of soil (Torsvik et al. 
1990b). 

In principle the next generation of sequencing 
technology should obviate the need for assump¬ 
tions about taxa abundance curves. 
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Figure 16.4 The relationship between species richness and the total number of individuals over 30 orders of magnitude for various ratios of Nj/N max 
by assuming that N min is equal to 1 and a log-normal taxa area curve (Curtis et al. 2002). As a rule of thumb, soil has an Nj/Nmax ratio of 10 and seas 
and lakes a ratio of 4. There are about 10 30 bacteria in the world (Whitman et al. 1998). 


16.7.4 Estimating diversity by inferring a 
distribution from the data 

Of course, in an ideal world one would not assume 
or ignore the distribution of the data, but infer it 
from the data. However, to do so one must con¬ 
front the problem of sample size. If the sample size 
is too small there is so much uncertainty attached 
to the parameters of the distribution as to render 
the estimates useless. One would do far better to 
fall back on assumed distributions or simpler non- 
parametric methods. It is only very recently that 
sample sizes have become large enough to permit 
credible estimates to be made. An early attempt to 
circumvent the issue by fitting a taxa abundance 
curve to reassociation data (Gans et al. 2005) gave 
very large (•—10 6 species per gram) estimates of 
diversity. It was subsequently criticized because the 
variance in these estimates proved too high to allow 
a meaningful estimate to be made (Bunge et al. 2006; 
Hong et al. 2006). 

Robust methods for determining the diversity 
from robust estimates of the distribution of the taxa 
rest on approaches pioneered by (Chao & Bunge 
2002), who looked at the y distributions of species. 


Hong, Bunge and colleagues applied this approach 
to microbes and considered a wider variety of dis¬ 
tributions (Hong et al. 2006). They could only obtain 
a suitable fit by using a subset of the data avail¬ 
able to them, a stratagem that was perhaps nec¬ 
essary because their dataset was also too small. 
Most recently Quince et al. (2008) have exploited 
the new generation of pyrosequencing data and 
extended Bunge's likelihood approach and demon¬ 
strated how it can be used to determine the sample 
sizes required to capture a specified fraction of the 
abundance of a particular gene or indeed all the 
genetic material in an environment. The microbial 
literature has been well reviewed by Sloan et al. 
(2008) and Connolly and Domelas have written a 
very accessible introduction to Bayesian methods in 
Chapter 10. Those seeking a deeper understanding 
of Bayesian statistics can find guidance elsewhere 
(Gelman 2003). 

The core of the new approach is the use of a like¬ 
lihood function. A likelihood function in essence 
allows the estimation of particular model parame¬ 
ters from a given distribution given a particular 
dataset (in our case observed taxa abundances). 
The beauty of such an approach is that it not only 
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allows one to estimate the parameters of a par¬ 
ticular species distribution and thus the diversity, 
but it also allows one to estimate the uncertainty 
around those parameters and thus the diversity 
estimated. However, what a likelihood function 
cannot tell you is which distribution to choose. 
That is still a question of judgement or intuition. 
Nevertheless, one's choice of distribution can be 
evaluated. 

This likelihood function can then be used in con¬ 
junction with Monte-Carlo-Markov chain proto¬ 
cols to generate a distribution of the model parame¬ 
ters given the data. The choice of species abundance 
distribution is at present arbitrary. A comparison of 
number of data sets by Quince et al. (2008) found 
the log-normal to give a good fit for many, but 
not all, data sets and the inverse Gaussian, log-t, 
and Sichel all gave good fits in some cases. How¬ 
ever, the exponential, y, and a mixture of expo¬ 
nential were found to be unsatisfactory. Not sur¬ 
prisingly, the estimates of diversity tended to con¬ 
verge to within a factor of 2 in the better sam¬ 
pled environments (Fig. 16.5). This suggests that the 
best strategy is to take a large enough sample and 
choose from a range of distributions to obtain a 
satisfactory fit. 


16.8 Estimation of required sample size 

Once one has some idea about the distribution one 
can make an estimate of the sample size required 
to obtain a particular fraction of the diversity. A 
conceptually simple way to do this is to generate 
artificial communities with the best estimated taxa 
abundance distributions and sample from them 
(Schloss & Handelsman 2006b). This approach is 
laborious and requires the size of the community to 
be defined. A slightly more elegant approach is to 
exploit the Bayesian approach to examine the effect 
of sample size on the unobserved proportion of the 
taxa (Chao & Bunge 2002). 

Such estimates are very dependant on an abil¬ 
ity to determine the true nature of the species 
abundance curve, and thus are sensitive to under¬ 
sampling and maybe substantially distorted by 
sequencing error (Fig. 16.5). 


16.9 In-depth metagenome analyses 

Traditionally, metagenomic studies have been 
something of a shot in the dark. However, if we 
know, or think we know, the abundance distrib¬ 
ution of a sample we can determine how many 
reads are required to capture a specified fraction of 
the metagenome and/or to capture all the genetic 
material of an organism of specified abundance. 
The procedure is described in greater detail else¬ 
where (Sloan et al. 2008; Quince et al. 2008) but it 
is a very simple extension of the estimation of the 
sample size for a specified fraction of the diversity. 
If the mean genome length is G and it is sequenced 
with a method yielding genome fragments of DNA 
(reads) of size R then the number of reads required 
to span a genome y is G/R. Quince et al. (2008) 
show that to calculate the expected sample size to 
capture a given fraction of the genomic material one 
must simply multiply y by the expected sample size 
required to capture the same fraction of the species 
diversity. Thus the best-fitting distribution for the 
Global Ocean Survey that 90% of the diversity could 
be captured with 35 million reads. The estimates 
are highly sensitive to the underlying taxa abun¬ 
dance curve. If the underlying distribution was log¬ 
normal nearly 10 10 reads might be required to cap¬ 
ture most of the genomic material. The ability to 
correctly determine the taxa abundance patterns 
will be of strategic importance to the rational explo¬ 
ration of the microbial world in years to come. 

16.10 Prospectus 

For the last 30 years microbial diversity stud¬ 
ies have relied on molecular tools to liberate 
this field from the constraints of culture and 
we have marvelled at what they have found. 
In the coming decades the new generations of 
sequencing technology and associated mathemat¬ 
ical and computational developments will lib¬ 
erate us from the woefully inadequate sample 
sizes of the first generation of molecular meth¬ 
ods. In so doing they will allow us to put the 
exploration of this frontier on a rational basis. 
We do not yet know what we will find; most 
likely new and fascinating molecules and processes 
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Figure 16.5 Upper panel: Bayesian parametric diversity estimates from fits of abundance distributions of samples from the sea (Global Ocean Survey), a 
Canadian soil, and a deep-sea vent. Estimates are given as medians with a 95% confidence interval. LN, log-normal; IG, inverse Gaussian; S, Sichel. The 
Chao estimates (C) and are also shown. Lower panel: the sampling effort (as 16S reads) necessary to sample 90% of taxa present. Adapted from Quince 
et al. (2008). The deep-sea vent is sample FS 396B in this paper. 


will be uncovered. We do know that the unexplored 
exceeds the explored by orders of magnitude. It 
seems likely that much of what we think we know 
will be changed and revaluated as new molecules 
and patterns emerge and sampling artefacts are 


seen for what they are. We need to create a new gen¬ 
eration of numerate and computer savvy molecular 
microbial ecologists to explore this immense fron¬ 
tier. They will no doubt regard much of which has 
done in the past 30 years as quaint and primitive. 
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16.11 Key points 

1. Microbial ecology is one of the most awe¬ 
inspiring frontiers in any science, and 
microbial diversity estimates are just the 
beginning. 

2. The traditional species concept is useless in 
microbial ecology; we must draw on more 
abstract and arbitrary measures of evolution. 

3. We use evolutionary relationships inferred from 
changes in molecules to determine differences 
between microorganisms. 

4. rRNA is the molecule of choice for inferring phy- 
logeny. However, it gives only a partial view 
of the diversity of a community, the true extent 
of which can only be inferred by considering all 
of the genetic material. 


5. Microbial diversity studies have shifted from a 
culture-based approach to a molecular view of 
microbial diversity and the biosphere. 

6. Sampling is of central importance to the esti¬ 
mation of microbial diversity, yet the disparity 
between the number of microbes and the num¬ 
ber of sequences we can examine deserves spe¬ 
cial consideration. 

7. Diversity can virtually never be measured 
directly, but rather it must be estimated or 
inferred from available data. Our estimates are 
anchored in the sample itself and the community 
or environment from which it was drawn. 

8. The advent of novel massive parallel sequenc¬ 
ing could finally place the rational exploration 
of microbial diversity on a scale corresponding 
with the magnitude of the task. 



CHAPTER 17 


Biodiversity and disturbance 

Maria Dorneias, Candan U. Soykan, and Karl Inne Ugland 


17.1 Introduction 

Disturbances play a fundamental role in shap¬ 
ing biodiversity patterns and ecosystem processes. 
Some disturbances completely destroy a commu¬ 
nity by killing all of its inhabitants. However, peri¬ 
odically disturbing ecological communities may 
enhance biodiversity by releasing resources (Holt 
2008) and by promoting the coexistence of species 
adapted to different conditions (Connell 1978). 
Given the importance of biodiversity for ecosystem 
functioning (Hooper et al. 2005) and specifically 
for ecosystem resilience (Folke et al. 2004), accu¬ 
rately measuring the effects of disturbances on bio¬ 
diversity is a fundamental step in the management 
of natural resources and conservation efforts. Here 
we review biodiversity metrics and their ability to 
detect the effects of disturbances. 

Considerable effort has gone towards measuring 
the effects of disturbances on ecosystem properties, 
with many metrics being developed for quantifying 
ecological integrity (Rice 2000; Costello et al. 2004; 
Jorgensen et al. 2005). There are available metrics 
for properties at different levels of organization, 
ranging from individual health to ecosystem func¬ 
tion. Given the focus of this book on species diver¬ 
sity, we restrict our discussion to community-level 
metrics, noting that a considerable body of work 
addresses individual-, population-, and ecosystem- 
level metrics (see Jorgensen et al. (2005)). Even at 
the community level, the sheer number of choices 
available makes it difficult to select an appropri¬ 
ate metric or suite of metrics for a specific pur¬ 
pose. Moreover, a lack of rigorous comparative 
studies means that the relative merits of the full 
range of community-level metrics is yet to be deter¬ 
mined (but see Abella et al. (2006), Patricio et al. 
(2006), Salas et al, (2006a), Salas et al. (2006b), and 


Ware et al. (2008) for examples comparing selected 
metrics). 

Although it represents an important research 
objective in the field of applied ecology, a full eval¬ 
uation of the various community-level metrics used 
to measure the effects of human disturbance on 
ecological communities is beyond the scope of this 
chapter. Instead, our objectives are to organize exist¬ 
ing metrics around a logical framework, describe 
their strengths and limitations, highlight important 
areas for future research, and make some recom¬ 
mendations based on current knowledge. Along 
these lines, in the sections that follow we group the 
various metrics into three categories with increas¬ 
ing preservation of information: univariate metrics, 
abundance distribution-based metrics, and multi¬ 
variate analyses. From each of these categories we 
describe selected metrics that are commonly used, 
effective, and/or novel. 

17.2 What is a disturbance? 

Before we can measure the effects of disturbance 
on biodiversity, it is useful to consider what we 
mean by disturbance. The term 'disturbance' is 
broadly used, and has a variety of meanings. One 
definition is that an ecological disturbance is any 
condition that causes mortality or loss of bio¬ 
mass (Huston 1994). Another definition includes 
any event that kills, displaces, or damages indi¬ 
viduals (Sousa 1984). Often a disturbed ecosystem 
is thought of as one damaged by anthropogenic 
actions, and disturbance is used interchangeably 
with human threat. Here we use the broader mean¬ 
ing of disturbance and consider both 'natural' and 
anthropogenic disturbances, although most of our 
examples are linked to human actions. We also 
try to refrain from assigning any judgment about 
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whether disturbances are 'good' or 'bad' for biodi¬ 
versity, and focus merely on measuring their effects. 
Ultimately, disturbances temporarily change the 
rules that govern community dynamics, poten¬ 
tially affecting biodiversity. However, disturbances 
come in many forms and influence communities 
in different ways. We will begin by briefly dis¬ 
cussing an overview of disturbances according to 
their source, time scale, spatial scale, intensity, and 
specificity. 


17.2.1 Source of the disturbance 

Disturbances may be classified as physical, chemi¬ 
cal, or biological. Examples of physical disturbances 
include fire, waves, drought, floods, ice, and wind. 
Chemical disturbances are usually associated with 
the presence of a substance the community is not 
adapted to. Examples include most types of pol¬ 
lutants, as well as freshwater in a marine envi¬ 
ronment, or salt water in a freshwater community 
(although neither would really be a disturbance in 
an estuarine environment). Biological disturbances 
include population outbreaks of predators, com¬ 
petitors, or diseases, as well as the introduction of 
invasive species. 

Disturbances are often classified as either nat¬ 
ural or anthropogenic. Some disturbances easily fall 
under one of these classes. For example, a volcanic 
eruption is clearly a natural disturbance and an oil 
spill an anthropogenic one. However, the distinc¬ 
tion is not always obvious: forest fires can either be 
a natural disturbance or the result of human action; 
a cyclone is a natural phenomenon, but human 
effects on climate change seem to be increasing their 
frequency and intensity (Schiermeier 2005). Fur¬ 
thermore, human action sometimes prevents nat¬ 
ural disturbances, like floods for example, and this 
also has important ecosystem consequences. In fact, 
given the influence humans have on the planet it is 
increasingly difficult to find pristine communities 
and to know whether or not a disturbance is 'nat¬ 
ural'. Hence, from a practical point of view it may 
be more useful to classify disturbances according 
to whether or not humans can manipulate them 
to maximize biodiversity as well as ecosystem ser¬ 
vices. In this review we will focus primarily on 


disturbances that are either produced by or can be 
manipulated by humans. 

17.2.2 Timescale 

As ecologists became aware that communities are 
continually being perturbed and rarely if ever are 
at equilibrium (Connell & Sousa 1983), the con¬ 
cept of disturbance evolved from that of a discrete 
event to that of a disturbance regime. Conse¬ 
quently, timescale is now recognized as an essen¬ 
tial facet of disturbance characterization. There are 
two main aspects of disturbance timing to be taken 
into consideration: duration and frequency. Dura¬ 
tion regards how long the disturbance affects a 
community. Duration is a continuous variable, with 
disturbances at the two extremes often known as 
press (chronic disturbances with very long dura¬ 
tions) and pulse (short and discrete disturbances) 
(Bender et al. 1984). Arguably, the former even¬ 
tually can no longer be considered a disturbance, 
once the community acclimatizes to the new con¬ 
ditions, but the resulting community may be very 
different from the one before the disturbance. Dis¬ 
turbance frequency is how often a community is 
disturbed. Both duration and frequency determine 
the effects of disturbances on biodiversity pat¬ 
terns. For example, reef corals react to temperature 
stress by expelling their symbiotic algae, a process 
known as coral bleaching. Bleaching events that are 
either very long or very frequent can result in the 
death of an entire reef, whereas corals can recover 
from shorter bleaching events given enough time 
between them (Anthony et al. 2009). 

17.2.3 Spatial scale 

The effects of a disturbance, as well as the com¬ 
munity's ability to recover, are strongly linked 
to its spatial scale. Some disturbances are local¬ 
ized, some are patchy, and some are regional. 
Large and spatially homogeneous disturbances 
usually have detrimental effects on biodiversity, 
whereas smaller or patchy disturbances can actu¬ 
ally enhance regional diversity (Huston 1994). In 
fact, the homogenization of environmental con¬ 
ditions in modified landscapes such as agro¬ 
ecosystems can be a threat to biodiversity (Dornelas 
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et al. 2009). Most importantly, spatial scale deter¬ 
mines how effective management tools can be in 
preventing and/or protecting a community from 
disturbances. For example, reserves can effectively 
deter hunting or fishing within them, but do not 
protect communities from pollution dispersed from 
other areas or from climate change. 

17.2.4 Intensity 

Communities are constantly undergoing all sorts of 
perturbations, both ecological and environmental. 
We can think of disturbances as intense perturba¬ 
tions with significant consequences to the commu¬ 
nity. The intensity of a disturbance can be measured 
as the strength of the disruptive force in action 
or as the extent of the damage it creates (Sousa 
1984). The more intense a disturbance is, the greater 
its effects on biodiversity patterns. However, even 
apparently subtle changes in the community can 
have drastic effects on ecosystem services (Hooper 
et al. 2005). Because species abundances are typi¬ 
cally highly asymmetric, the loss of a single species 
may correspond to dramatic reductions in total bio¬ 
mass (Gaston & Fuller 2008). By the same token, the 
loss of only a small proportion of individuals can 
correspond to the disappearance of many species 
because most species are rare. We therefore need 
powerful biodiversity metrics to be able to detect 
even subtle effects of disturbances. 

17.2.5 Specificity 

Species vary in their ability to withstand distur¬ 
bances, and disturbances also vary in whether they 
affect most species or only some. Some distur¬ 
bances, like a fire, are relatively indiscriminate in 
which species are affected. Others, such as disease 
outbreaks, affect a single species. Hence, the effects 
of disturbances on biodiversity sometimes are dras¬ 
tic reductions in the number of species, other times 
are shifts in the identity of the species that domi¬ 
nate the community, and often are a combination 
of the two. Additionally, disturbances often have 
both direct and indirect effects. For example, a fire 
in a forest may directly remove trees and indirectly 
affect all the species that depended on the trees for 


habitat or that are able to exploit the resources freed 
by the tree removal. 

17.2.6 Summary 

Different disturbances lead to different effects on 
biodiversity, therefore it is not surprising that there 
is no single ideal biodiversity metric to detect 
the effects of disturbances. When measuring these 
effects it is important to be as specific as possible 
about the nature of the disturbance being inves¬ 
tigated, and to make predictions about the effects 
we expect it to have on the community. Specifying 
these expectations allows the choice of appropri¬ 
ate biodiversity metrics to detect the effects of the 
disturbance. In the next section we will explore the 
available methods for detecting such effects. 

17.3 State of the field: measuring the 
effects of disturbance on biodiversity 

Biodiversity metrics can be organized according 
to how much information is preserved. Univariate 
metrics are the easiest to use and the most general, 
but retain the least information by concentrating 
it into a single value. Metrics based on abundance 
distributions attempt to retain more information 
while remaining general by ignoring species identi¬ 
ties (and consequently allowing the comparison of 
communities in different biogeographic regions and 
even habitats). Finally, multivariate metrics retain 
the most information, including species identities, 
and therefore allow the detection of compositional 
changes. In this chapter we review available met¬ 
rics and discuss their advantages and disadvan¬ 
tages (Table 17.1). We do not attempt an exhaustive 
overview of all metrics available, but focus instead 
on representative examples. 

17.3.1 Univariate metrics 

Univariate metrics are often grouped into 
three main categories: species richness-, species 
evenness-, and species diversity-based metrics. We 
refrain from a detailed description of how they 
can be calculated because Chapters 4 and 5 are 
dedicated to species richness estimation and the 
comparison of diversity and evenness measures, 
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Table 17.1 Examples of methods available to quantify the effects of disturbance on biodiversity. 


Method 

Advantages 

Disadvantages 

Comments 

Useful software 

Univariate 

Simple to interpret, easy to 

Insensitive so more subtle 

Species richness and 

Estimates, Primer, EcoSim 

Species richness 

Species evenness 

Species diversity 
Phylogenetic diversity 

Trait diversity 

calculate, general 

changes 

evenness complement one 

another 

Diversity indices may mask 
differences in components 
Phylogenetic and trait 
diversity are promising 


SADs 

General, more powerful, 

Many approaches proposed 

Promising area with more 


Non-parametric 

flexible, allow testing for 

but not standardized, not 

methods emerging 


Slope of rank abundance 
eCDF 

k dominance 

ABC curves 

Q statistic 

Size spectra 

Parametric 

Log-series a 

Log-normal 

Gambin a 

several different effects: 
rare species, dominant 
species, community 

structure 

often used 

Comparisons among 
methods needed 


Multivariate analysis 

Powerful, flexible 

Site specific and sometimes 

R mode allows the 

Vegan package in R 

R-mode approach 

Allow testing for several 

difficult to interpret 

identification of the 

Primer 

Q-mode approach 

different effects: rare 
species, dominant species, 
species composition 


contribution of different 
species to overall patterns 

Q mode approach often 
yields more interpretable 
results 

CAN0C0 


respectively. To these three categories we add 
phylogenetic and trait diversity, discussed at length 
in Chapters 13 and 14, respectively, but worthy 
of mention because of their potential application 
in environmental assessment (Warwick & Clarke 
1998; Rosenfeld 2002). 

Just as it is the most commonly used diver¬ 
sity metric in basic studies, so too species richness 
is the most popular of the metrics used to mea¬ 
sure the effects of human disturbance on ecolog¬ 
ical communities (Costello et al. 2004). Although 
species richness often declines at high levels of dis¬ 
turbance, several issues interfere with its perfor¬ 
mance. Firstly, under certain circumstances richness 
is relatively insensitive to disturbance, often show¬ 
ing little change in spite of considerable human 
influence (Chapin III et al. 2000; Hillebrand et al. 
2008). Secondly, richness often follows a unimodal, 


or hump-shaped, pattern, peaking at intermediate 
levels of disturbance (Grime 1973a; Horn 1975; Con¬ 
nell 1978; Huston 1979; Sousa 1979). Finally, species 
richness estimates are proportional to the area sam¬ 
pled, sample size, and the number of samples col¬ 
lected, complicating comparisons among sites that 
differ in sampling intensity (Gotelli & Colwell 2001; 
Magurran 2004; see also Chapter 4). These caveats 
aside, its ease of calculation (only presence-absence 
data are required) and its intuitive appeal make 
species richness a fundamental biodiversity metric 
that should be used in studies on the effects of 
disturbances. 

Species evenness—the relative asymmetry in 
abundance of species in a community—responds 
more rapidly to human disturbance than species 
richness (Hillebrand et al. 2008). Moreover, empiri¬ 
cal evidence that evenness metrics complement pat- 
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terns revealed by species richness is accumulating 
(Death & Zimmerman 2005; Pillans et al. 2007; see 
the example in Box 17.1). The momentum for the 
use of evenness indices has also come from an 
influential review by Smith & Wilson (1996), who 
identified a set of preferred metrics. Specifically, 
the authors conclude their study by recommend¬ 
ing four indices, Simpson evenness, Carmago even¬ 
ness, the Nee et al. index (£q), and what many 
are now calling the Smith-Wilson evenness index 
(£„„, ), as best meeting a set of required and desir¬ 
able criteria for any index that purports to mea¬ 
sure evenness. The combination of these factors has 
piqued interest in measuring changes in evenness 
due to human disturbance, encouraging numerous 
authors to include an evenness measure in their 
study (see, for example, Stewart et al. (2005), Todd 
(2006), and Dampier et al. (2007)). Empirical studies 
that report changes in evenness are accumulating, 
but the implications and causes of such changes 
remain poorly understood. An important, and as 
yet nearly unexplored, research theme will involve 
developing a theoretical framework for explain¬ 
ing the patterns that emerge, as well as predicting 
future changes in evenness as a result of human 
activity (see Drobner et al. (1998) for a promising 
first step). 

Species diversity (sensu Hurlbert (1971)), also 
referred to as heterogeneity (sensu Peet (1974)), 
indices attempt to combine evenness and species 
richness into a single entity. A number of diver¬ 
sity metrics have been developed with this goal 
including the commonly used Shannon index (other 
popular ones include Shannon evenness, Simpson's 
dominance. Hill's diversity indices, and the Bril- 
louin index). These metrics differ in the degree to 
which they emphasize richness versus evenness 
(Huston 1994; Magurran 2004), but all confound 
them to some degree. As a result different com¬ 
munity states can have the same value of a given 
diversity metric, and these metrics as a whole have 
been shown to perform poorly under a variety of 
circumstances (Magurran 2004). In spite of decades 
of admonitions against the use of these metrics (see, 
for example, Hurlbert (1971)), they remain popu¬ 
lar. So that there is no confusion in the reader's 
mind, we urge caution when using any univariate 
metric that combines richness and evenness into a 


single value—the two components of diversity are 
often orthogonal and should usually be treated as 
such (Stirling & Wilsey 2001). In Box 17.1 we show 
how the two metrics complement each other and 
provide an example where species richness allows 
the detection of the effects of one type of distur¬ 
bance, whereas evenness allows the detection of 
another. 

Recognizing the fact that heterogeneity mea¬ 
sures combines species richness and evenness, 
Buzas & Hayek (1996) developed a method to 
decompose the Shannon diversity metric into its 
two components, species richness and evenness. A 
major advantage of this approach is that it allows 
researchers to interpret changes in diversity. Thus, 
a decrease in diversity as a result of human activ¬ 
ity can be attributed to a loss of richness, even¬ 
ness, or both. Termed SHE analysis, this decom¬ 
position can also be used to analyse changes in 
community structure along a gradient (Buzas & 
Hayek 1998). Specifically, as samples are accumu¬ 
lated, species richness will increase; this, in turn, 
will affect values for Shannon diversity and even¬ 
ness. The ways in which diversity and evenness 
change along a gradient can be used to look for 
breaks in the pattern that indicate a change in the 
community. 

The phylogenetic metrics of biodiversity were 
developed in a bid to incorporate differences 
among species and evolutionary history into bio¬ 
diversity metrics. These metrics and their relative 
performance are reviewed by Vellend et al. in Chap¬ 
ter 14. Typically, the magnitude of phylogenetic 
diversity (estimated from the position of species 
in a phylogenetic tree) of a community or set of 
communities is compared with that of a reference 
(the regional phylogenetic diversity). One example 
of this type of metrics that has been used to detect 
the effects of disturbances is taxonomic distinct¬ 
ness, which uses taxonomic hierarchy as a proxy 
for phylogenetic information (Clarke & Warwick 
1998; Warwick & Clarke 1998). Taxonomic distinct¬ 
ness tends to decline with increasing human distur¬ 
bance, is extremely robust in the face of variations 
in sampling effort, requires only incidence data, 
and is one of the few phylogenetic measures that is 
unbiased by species richness (Schweiger et al. 2008). 
A drawback is the fact that taxonomic distinctness 
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Box 17.1 The complementary nature of species richness and evenness 


Death & Zimmerman (2005) studied aquatic invertebrate 
biodiversity patterns in 10 New Zealand streams that 
differed in substrate movement (a natural disturbance). For 
each stream they compared communities under native forest 
canopy and at sites surrounded by grassland (a human 
disturbance). Their main conclusion, that 'invertebrate 
communities in these mountain streams were strongly 
influenced by both the stability of the streambed and 
the presence or absence of a forest canopy', was influenced 
by their use of multiple, complementary diversity metrics. 

For each sample these authors calculated rarefied 
species richness (species richness estimates standardized 
for sampling effort (see Chapter 4) and evenness (using the 


Figure 17.1 (a) Rarefied species richness for aquatic invertebrates 
along a gradient of natural disturbance (measured using stone 
movement); closed circles represent sites with a native forest canopy, 
while open circles represent open grassland sites, (b) Species 
evenness along the same gradient. Note that rarefied richness 
declines with increasing natural disturbance for both types of sites, 
however, it does not differ between closed and open sites. In 
contrast, evenness differs between closed and open sites, but only 
declines with increasing natural disturbance at the open sites. 


Smith and Wilson evenness index, E m (see also Chapter 5). 
Species richness allowed the authors to detect an effect of 
increasing natural disturbance (measured as stone 
movement). However, rarefied richness did not differ 
between forest canopy and grassland sites (Fig. 17.1a) and 
therefore did not allow the detection of the human 
disturbance. E var differed significantly between forest 
canopy and open grassland sites, with greater evenness 
under undisturbed native forests (Fig. 17.1b). This example 
illustrates well the complementary nature of species 
richness and evenness, therefore demonstrating the value 
of measuring both rather than combining them into a single 
metric. 


(a) 
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is often less sensitive than other indices (Clarke 
& Warwick 1998) and may be influenced by nat¬ 
ural environmental gradients (Abella et al. 2006). 
Despite its faults (see Hall & Greenstreet (1998), 
Abella et al. (2006), Salas et al. (2006b), and Ware 
et al. (2008)), it is a promising metric for the 
assessment of disturbances to ecological commu¬ 
nities (Clarke & Warwick 1998). Because it mea¬ 
sures a fundamentally different aspect of diversity, 
taxonomic distinctness should be seen as comple¬ 
menting, rather than replacing, the other metrics 
described in this section. 

An alternative way of incorporating differ¬ 
ences among species into biodiversity metrics is 
trait diversity. Under this approach evolutionary 
and taxonomic relationships are ignored, and we 
focus instead on species functional traits, link¬ 
ing community dynamics to ecosystem function¬ 
ing (Hooper et al. 2005). Historically, trait (or 
functional) diversity was estimated by aggregat¬ 
ing species into groups representing distinct func¬ 
tional niches (Rosenfeld 2002). Because this clas¬ 
sification is necessarily a subjective step, recently 
numerous alternative approaches have emerged 
(e.g. Petchey & Gaston 2002; Cornwell et al. 2006; 
Podani & Schmera 2006) and their relative merits 
are discussed by Weiher in Chapter 13. As with 
species and taxonomic diversity, trait diversity can 
be decomposed into functional richness, evenness, 
and diversity (Mouillot et al. 2005). Functional rich¬ 
ness is the amount of functional space occupied by 
the species of the community, functional evenness 
of a community depends on how regular species 
positions are in trait space, and functional diver¬ 
sity incorporates species abundance to weigh their 
contribution to the community attributes. The novel 
metrics of trait diversity are leading to the rapid 
expansion of this field. For example, recent studies 
have shown how trait diversity buffers ecosystem 
functioning from human perturbations (Hooper 
et al. 2005). Using trait diversity to quantify the 
effects of disturbances is a promising approach, 
which we predict will be fruitful. 

The metrics described in this section complement 
each other, as they quantify different aspects of bio¬ 
diversity, We can illustrate this with a hypotheti¬ 
cal example. Let's imagine a plant assemblage with 
four species: an abundant grass, a common shrub. 


and two rare mosses. Let's imagine that an inva¬ 
sive shrub (phylogenetically distant from all the 
other species) is introduced in this assemblage and 
becomes the dominant species by out-competing 
the grass and the other shrub. In this scenario 
species richness and phylogenetic diversity would 
increase (because of the invasive species), evenness 
would decrease (because the assemblage would 
have one very abundant and four rare species), 
and trait diversity would decrease (because the 
community becomes dominated by the invasive 
shrub). 

17.3.2 Species abundance distribution-based 
metrics 

As with the univariate indices, species abundance 
distributions (SADs) are covered in other chap¬ 
ters of this book. These metrics are discussed by 
McGill in Chapter 9 while Connolly and Dornelas 
(Chapter 10) describe how to best fit models to 
SADs. Here we discuss non-parametric metrics 
based on SAD shape and the application of para¬ 
metric models to the study of human disturbance. 
We start by discussing non-parametric measures, 
namely the slope of the rank abundance dis¬ 
tribution, the empirical cumulative distribution 
function, k dominance plots, abundance/biomass 
comparison (ABC) curves and size spectra. We then 
focus on the use of models fitted to SADs to detect 
effects of disturbances and specifically discuss the 
log-series, log-normal and gambin models, as well 
as the Q statistic (not to be confused with Q-mode 
multivariate analysis, which is described in the fol¬ 
lowing section). 

One of the first approaches suggested for com¬ 
paring communities was the shape of rank abun¬ 
dance distribution (RAD) plots (Whittaker 1965; 
Frontier 1985), also known as dominance/diversity 
curves (Wilson 1991). RAD plots have the ranks 
of the species plotted on the x-axis in descend¬ 
ing order (from most abundant to least abundant) 
and their abundances plotted on the y-axis (Fig¬ 
ure 17.2a). The slope of the RAD measures even¬ 
ness, with a shallower slope indicative of a more 
even distribution of abundances among species. 
While they offer a useful and appealingly simple 
visual check of community structure, the slopes 
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(a) 


(b) 





Figure 17.2 Graphical approaches for measuring species diversity, (a) Rank abundance distributions (RADs); (b) the empirical cumulative distribution 
function (eCDF); (c) abundance biomass comparison (ABC) curves; (d) the 0 statistic, calculated as the interquartile slope of the cumulative species 
abundance curve. 


of RAD curves are influenced by the number of 
species in the community, confounding interpreta¬ 
tion (thus, it is really a diversity index, rather than 
an evenness index). We therefore caution against 
the use of RAD slopes to compare communities that 
differ in species richness. Note, however, that an 
evenness index proposed by Nee et al. (1992) and 
modified by Smith and Wilson (1996), Eq, corrects 
for the problems encountered with the slope of the 
RAD. Specifically, ranks are scaled by dividing by 
the maximum rank, and independence from species 
richness is achieved by taking the inverse of the 
usual RAD slope (Smith & Wilson 1996). 

The empirical cumulative distribution function 
(eCDF) (Chapter 9) is a graphical method that uses 


the visual appeal of RADs as inspiration, while 
addressing their analytical shortcomings. The CDF 
is commonly used in mathematics and quantitative 
sciences. It is interpreted as the proportion of points 
with a value (here abundance) less than a given 
value. The eCDF is generated by sorting species 
from smallest to largest abundance, and then cal¬ 
culating the cumulative abundances. Both axes 
are standardized by dividing by species richness 
and total abundance (Figure 17.2b). In so doing, a 
richness-independent, mathematically familiar, yet 
conceptually simple graph is generated for com¬ 
paring ecological communities along gradients of 
human activity (Soykan et al. in preparation). In 
particular, both the slope of the eCDF and the inter- 
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cept of the inflection point can be used to compare 
curves of disturbed and undisturbed communities. 
The slope is indicative of evenness, whereas the 
intercept of the inflection point indicates the frac¬ 
tion of rare species. Fig. 17.2b shows how the eCDF 
curve changes through a gradient of disturbance 
from an undisturbed forest to an agriculture plot. 

Another approach that builds on the RAD con¬ 
cept is the k-dominance plot, where the cumulative 
abundance of species in an assemblage is plotted 
against log-species rank (Lambshead et al. 1983; 
Platt et al. 1984). As with the RAD, the species are 
ranked along the x-axis from most to least abun¬ 
dant, but it is their cumulative abundance that is 
plotted, rather than their individual abundances. 
Noting that dominance is often affected by pol¬ 
lution; Platt et al. (1984) demonstrate the applica¬ 
tion of k-dominance plots for discriminating among 
meoifaunal assemblages exposed to different levels 
of pollution. Plots that are straighter (less convex) 
indicate a community with greater evenness that 
has been less disturbed than plots that are curved 
(more convex). In general, if the curve representing 
one assemblage lies entirely above another, the 
former assemblage can be said to be less diverse 
(and thus more affected by human activity) than 
the latter (Lambshead et al. 1983). However, com¬ 
plications arise when the two curves cross, mak¬ 
ing it impossible to say which community is more 
diverse. Lambshead et al. (1983) insightfully point 
out that such a situation results in different univari¬ 
ate diversity indices producing conflicting diver¬ 
sity rankings (because they weight different aspects 
of diversity differentially, i.e. abundant versus rare 
species). 

One variation of the k-dominance plot, ABC 
curves does not rely on the presence of multi¬ 
ple communities for comparison (Warwick 1986). 
Instead, two k-dominance curves are generated for 
each assemblage, one based on abundance and 
one on biomass. The two curves are then com¬ 
pared and their relative positions used to infer the 
state of the community (Fig. 17.2c). Several met¬ 
rics, including the W, d, and R statistics, have been 
developed to quantify the difference between abun¬ 
dance and biomass curves (Clarke 1990). The W 
statistic, which measures the total distance of each 
curve from the x-axis, has become the most pop¬ 


ular (although Clarke himself recommends the i 
or R statistics, which measure the area between 
the curves, suggesting that they are more likely to 
detect differences between communities). The value 
of W ranges between -1 and 1, with positive values 
indicative of undisturbed communities and nega¬ 
tive values indicative of disturbance. Fig. 17.2c illus¬ 
trates the use of ABC curves to detect the effects of 
disturbances. 

Related to k-dominance plots are what is known 
in the fisheries literature as community size spec¬ 
tra (Rice 2000) and in the terrestrial literature as 
body-size distributions (Brown 1999). The size spec¬ 
trum refers to the smooth relationship of aggre¬ 
gate numbers, biomass, or diversity to size interval 
across the sampled size range of the commu¬ 
nity (i.e. how many individuals, how much bio¬ 
mass, or what is the diversity of the different size 
classes of organisms that make up the commu¬ 
nity). Size spectra have been shown to vary with 
habitat changes (White et al. 2004) and fishing 
pressure (Bianchi et al. 2000). This is likely due 
to the greater susceptibility of large-bodied indi¬ 
viduals to fishing pressure and habitat modifica¬ 
tion (Gislason & Rice 1998). These changes can be 
used as an indicator of human disturbance that 
can be calculated over time for a given system or 
spatially across communities. Like trait diversity, 
size spectra focus on community trait distributions 
rather than taking a strictly taxonomic approach 
to measuring biodiversity. Given the links between 
body size and metabolism, changes in size spectra 
have significant implications for ecosystem fluxes 
and thus potentially ecosystem services. We there¬ 
fore feel this is a promising approach to detect¬ 
ing community changes with important ecological 
consequences. 

A potentially efficient way to detect changes in 
the shape of SADs is to compare parameter esti¬ 
mates or the goodness of fit of models to these 
types of data. However, with the exception of the 
log-series cx, parametric measures have rarely been 
applied to detect the effects of disturbances. This 
may be due, in part, to the computational difficul¬ 
ties previously associated with estimating parame¬ 
ters. Given the widespread availability of appropri¬ 
ate software (Table 17.1), this is no longer a problem. 
With the main obstacle to their use removed, the 
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study of parametric diversity indices represents a 
promising, relatively unexplored area of research. 
Fig. 17.2a shows a RAD plot of some models that 
have been used to detect the effects of disturbances. 
There are over 30 models currently available that 
can be used for this purpose, and there is no consen¬ 
sus as to which models are best (McGill et al. 2007). 
We will focus only on some of the models that have 
been used in this context and appear to be effective: 
the log-series, log-normal and gambin models. We 
note that, although model parameters may be able 
to detect the effects of disturbances regardless of 
the goodness of fit of the model to the data (Kemp- 
ton & Taylor 1976) (i.e. even if a model does not 
fit the data very well, its parameter estimates may 
reflect changes in the distribution), following best 
practices in parameter estimation and goodness- 
of-fit testing (Chapter 10) is likely to enhance any 
method's efficiency. 

Log-series cc, also known as Fisher's a in honour 
of its developer (Fisher et al. 1943), is a measure of 
diversity that has occasionally been used in applied 
studies (see, for example, Fiedler & Schulze (2004), 
Wolf (2005), and Hilt et al. (2006)). It is a parametric 
measure calculated in order to fit the log-series dis¬ 
tribution. However, regardless of whether a given 
community fits the log-series distribution well or 
not, log-series a can be used as a measure of diver¬ 
sity (Kempton & Taylor 1974, 1976; Taylor 1978). 
Like all measures of diversity, it confounds species 
richness and evenness to some extent. Nevertheless, 
unlike the diversity measures mentioned in the pre¬ 
vious section, it is relatively unaffected by sample 
size and confidence intervals can be calculated eas¬ 
ily (Anscombe 1950). 

The log-normal distribution is probably the most 
commonly used SAD model, and deviations from 
this model have been suggested to be indicative 
of disturbance (Kempton & Taylor 1974; Ugland 
& Gray 1982; Gray 1987). For example, Kevan 
et al. (1997) fitted log-normal curves to pollina¬ 
tor abundance data in areas with and without 
insecticide application. They found that the undis¬ 
turbed sites had an SAD that resembled a log¬ 
normal distribution, while the disturbed sites did 
not. Ugland & Gray (1982) argue that the log¬ 
normal is a good descriptor of undisturbed habi¬ 
tats, but under disturbance the single log-normal 
breaks down into several independent curves. In 


fact, there is evidence of changes in species abun¬ 
dance due to human disturbance resulting in mul¬ 
timodal SADs (Ugland & Gray 1982; Dornelas et al. 
2009). It has, therefore, been suggested that the log¬ 
normal model provides an objective standard by 
which ecosystem health can be evaluated, in con¬ 
trast with most metrics, which require a reference 
or control for comparison with the disturbed com¬ 
munity (Gray 1981, 1983). However, there is sub¬ 
stantial controversy surrounding the log-normal 
as a SAD model (e.g. McGill 2003b; Volkov et al. 
2003; Williamson & Gaston 2005). Nevertheless, we 
believe this to be a promising approach that merits 
both more empirical tests and theoretical explana¬ 
tion. A way to quantify these effects is to fit differ¬ 
ent parametric distributions to SADs. For example. 
Gray (1987) points out that the geometric series of 
Motomura fits communities that are poor in species, 
such as those under extreme pollution, while the 
log-series fits moderately disturbed communities 
and the log-normal best describes SADs for undis¬ 
turbed sites. However, using different models to 
measure a gradient in human disturbance is dif¬ 
ficult to justify theoretically and presents practi¬ 
cal challenges regarding the selection of the most 
appropriate model for each SAD. 

An alternative is to use a flexible distribution that 
can accommodate variability in shape within a sin¬ 
gle framework. An example is the gambin model 
(named for the two distributions, y and binomial, 
used to fit the model) recently proposed by Ugland 
et al. (2007). Despite its compound nature, the 
model has only a single parameter, a, for which the 
authors provide a biological interpretation. Specifi¬ 
cally, Ugland et al. (2007) suggest that oc is related to 
niche space and the number of processes structur¬ 
ing ecological assemblages. Lower values indicate 
few structuring processes, as might be expected in 
stressed or disturbed systems. Thus, Ugland et al. 
(2007) conclude that gambin a might be useful for 
the assessment of ecosystem health (see the exam¬ 
ple in Box 17.2). 

The Q statistic (Kempton & Taylor 1976) is based 
on the cumulative species abundance curve (not to 
be confused with the eCDF). It can be calculated by 
plotting the cumulative abundance curve and mea¬ 
suring the interquartile slope (see Fig. 17.2d). The 
fact that the most common and rarest species are 
excluded from the calculation of Q means it is less 
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Box 17.2 Using changes in the species abundance distributions to measure the effects 
of disturbance 


Graham et al. (2004) examined the effects of habitat 
disturbance on the abundance of ants in the southeastern 
USA. Certain parts of the study area were heavily disturbed 
by mechanized infantry training. Tanks and other vehicles 
had denuded large areas of most of their vegetation. 
However, relatively undisturbed forest was found adjacent 
to the training areas. It was therefore possible to study ant 
communities across a disturbance gradient, and the area 
was divided into three classes: low, medium, and high 
disturbance. Lightly disturbed areas had minimal 
disturbance to soils, extensive woody vegetation and 
canopy cover. Moderately disturbed areas had an intact 
canopy but active erosion was often observed. Gullies were 
present in these areas and there was evidence of surface 
soil loss. Heavily disturbed areas had greatly reduced plant 
cover. There were large patches of bare ground and many 
signs of active erosion. Ant communities were sampled at 
each level of disturbance. 

SADs are often presented as histograms with log-2 
abundance classes (octaves), known as Preston plots. 
Binning the ant abundance data into octaves containing 1, 


variable to changes in dominance or the number 
of rare species, which are much affected by sam¬ 
pling effort. For this same reason, however, the Q 
statistic is a relatively insensitive metric of biodiver¬ 
sity change. Importantly, the Q statistic can be cal¬ 
culated using parametric SAD models such as the 
log-series, log-normal, or y distributions. For exam¬ 
ple, Q equals the log-series diversity parameter oc 
(Kempton & Taylor 1976). Although not strictly a 
parametric metric, we mention the Q statistic here 
because calculating it using the fitted models yields 
more precise estimates than doing it directly from 
the data. 

Finally, we want to emphasize that SADs often 
provide powerful illustrations of the effects of dis¬ 
turbances on community structure. SAD plots can 
be visually compared among various communities 
in order to detect changes brought about by human 
disturbance. Showing dramatic changes in com¬ 
munity structure visually (see Fig. 17.3) is a useful 
addition to any quantitative analysis. 


2-3, 4-7, 8-15, etc. individuals/species yields the three 
sets of histograms shown in Fig. 17.3a—17.3c. The effects 
of this type of disturbance are immediately apparent, with 
community structure showing a classical response: an initial 
log-normal-like species abundance curve gradually changes 
into a log-series-like curve. These changes can be 
quantified by fitting the gambin model to the data 
(Fig. 17.3; Ugland et al. 2007). The model is fitted by 
maximum likelihood estimation of the parameter a. 
Differences among disturbance regimes are clearly reflected 
in the gambin a parameter, which takes the values 
a Low = 5.0, a Mc dium = 3.2 and a Hig t, = 1.5. In the lightly 
disturbed areas the number of processes structuring the ant 
community is large and therefore the species abundance 
curve appears log-normal. At high levels of disturbance the 
a parameter drops down to 1.5 because the main process 
structuring the community under these conditions is 
'tolerance' to sandy eroded soils. Only a few species 
maintain a large population size while most of the species 
abundances drop dramatically, with a substantial fraction 
disappearing from the habitat. 


17.3.3 Multivariate analysis 

So far, with the exception of phylogenetic and trait 
diversity, we have discussed metrics that are inde¬ 
pendent of species identities. However, because 
species have different susceptibilities, community 
composition is usually modified by disturbance. 
That is, along a disturbance gradient robust species 
usually replace species that are more sensitive 
to that particular disturbance. Therefore, tracking 
communities through multivariate space, where 
species abundances are the multiple variables, 
allows the detection of subtle effects of disturbances 
on community composition. Several techniques can 
be used to this effect, and we will give a brief 
overview of some of the options available. We rec¬ 
ommend consulting more comprehensive descrip¬ 
tions of multivariate analysis (e.g. Legendre & 
Legendre 1998; Quinn & Keough 2002; Leps & Smi- 
lauer 2003) to readers unfamiliar with this type of 
approach. 
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Figure 17.3 Species abundance distribution curves and the Gambin model for pitfall samples of ants in the southeastern USA (Graham et al. 2004) for 
three regimes of habitat destruction (the significance probability (P) is based on 1000 simulations see Ugland et al. (2007)). (a) low, P = 0.32; (b) medium, 
P = 0.46; (c) high, P = 0.13; (d) comparison of the three fitted curves: low, solid line; medium, dashed line; high, point dashed line. 


Multivariate analysis aims to reduce the num¬ 
ber of dimensions in data while preserving as 
much information as possible. We will mention 
two approaches, often known as R-mode and Q- 
mode analysis (Quinn & Keough 2002). In the con¬ 
text of community composition, R-mode analysis 
rearranges the data based on associations between 
species; whereas Q-mode analysis is based on simi¬ 
larities between sampling units. That is, in R-mode 


analysis the structure in the covariances (or cor¬ 
relations) among species is analysed to extract 
new synthetic variables that concentrate the orig¬ 
inal variance in fewer variables. Some commonly 
used methods that follow the R-mode approach 
are principal component analysis (PCA), redun¬ 
dancy analysis (RDA), and correspondence analy¬ 
sis (CA). In Q-mode analysis, a matrix of pair-wise 
similarities among all sampling units is calculated 
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Box 17.3 Detecting the effects of disturbances on community composition 


Magurran and Phillip (2001) collected an extensive dataset 
of freshwater fish communities from Trinidad and Tobago. 
From these data, we selected seven pairs of sites in similar 
freshwater streams, which included an undisturbed site and 
a heavily polluted site. At each site Magurran and Phillip 
(2001) took a standardized sample and identified all fish 
caught to species. Here we investigate if disturbed 
communities (i.e, communities from polluted sites) differ in 
composition from undisturbed communities. 

We use non-metric multidimensional scaling available in 
the R-package vegan (Oksanen et al. 2008; a good source 
of more detailed advice on these methods can be found in 
the vegan tutorial). We begin by transforming the species 
x site matrix using a Wisconsin double standardization. 
This standardization consists of dividing each species by its 
maximum and each site by its total. It often improves the 
results because it prevents the dominant species from 
overwhelming distance measures, as well as preventing 
differences in community size from confounding the results. 
Secondly we calculate a site x site Bray-Curtis distance 
matrix. Finally, we re-scale ranked distances onto new 
dimensions using Kruskal's non-metric multidimensional 
scaling (Kruskal 1964). This ordination method finds the 


coordinates of each site on synthetic dimensions, so that 
the order of distances among sites is as close as possible to 
the original matrix. The success of the ordination can be 
checked by looking at the R 2 of a regression between 
observed and ordination distances (which measures the 
amount of variance explained by the ordination). A linear fit 
between the Bray-Curtis distances we calculated and the 
ordination distances has an R 2 of 0,953 and a non-metric 
fit has an R 2 of 0.991. Hence, we can be confident that the 
ordination has not distorted the observed distances 
excessively, 

A plot of the results shows that the first dimension 
separates most disturbed and undisturbed sites (Fig. 17.4), 
but there are two polluted sites and two clean sites that are 
in between the two clusters. The second dimension 
separates these four sites, so that clean sites cluster in the 
bottom right corner and polluted sites in the upper left 
corner. Performing an analysis of similarity would allow 
testing of whether the effects of disturbance on the species 
composition of these freshwater fish communities are 
statistically significant in a classical hypothesis testing 
fashion. 



PCoAl 


Figure 17.4 NMDS ordination plot of polluted (black circles) and unpolluted (white circles) sites from freshwater fish communities in Trinidad and 
Tobago (data from Magurran and Phillip (2001)). 


using species abundances. The sampling units are 
then re-scaled onto new dimensions that minimize 
the distortion of the original distances. The meth¬ 
ods that follow Q-mode analysis most commonly 
used in community ecology are principal coordi¬ 
nate analysis (PCoA) and multidimensional scal¬ 
ing (MDS) Box 17.3. 


In Q-mode analysis the choice of similarity mea¬ 
sure is crucial, and there is a vast array of measures 
available (see, for example, Legendre & Legendre 
(1998)) for detailed descriptions). Two commonly 
used measures in this context based on presence- 
absence data are the Sorensen and Jaccard dis¬ 
tances, which differ only on the weight given to 
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shared vs unshared species. Modifications of these 
distances to be used with abundance data are 
known as Bray-Curtis (Bray & Curtis 1957) (also 
known as Steinhaus similarity) and Chao distances 
(Chao et al. 2005), respectively. It is important to 
note that using presence-absence data gives equal 
weight to all species in the communities, regardless 
of whether they are represented by a single indi¬ 
vidual or thousands. Since the majority of species 
are rare in most communities, using presence- 
absence based distances emphasizes the importance 
of rare species. In contrast, since the abundance of 
dominant species is typically orders of magnitude 
greater than that of the rarest species, abundance- 
based similarities tend to be mostly driven by the 
most abundant species. Anderson et al. (2006) pro¬ 
pose a flexible similarity measure that can be used 
to tune the weight of abundances. These authors 
also recommend running analyses with both com¬ 
positional (i.e. presence-absence based) and abun¬ 
dance changes to distinguish the effects of each 
facet on the patterns. 

Both R- and Q-mode approaches can be used to 
compare communities with different levels of dis¬ 
turbance, and in some cases they allow different 
questions to be addressed (see Legendre et al. (2008) 
for an example). However, R-mode methods, and 
to some extent PCoA, are particularly affected by 
the 'arch effect', where linear gradients are repre¬ 
sented as a curve with extremes scaled as close 
together (Minchin 1987). This is a consequence of 
the joint absence of species, and there is debate 
about whether it is an artefact of the scaling tech¬ 
niques or a real pattern (Wartenberg et al. 1987). 
From a practical point of view, the arch effect is 
problematic because it can make gradients impos¬ 
sible to recognize without prior knowledge of site 
order. Simulation studies have shown that MDS, 
and in particular non-metric MDS, are the methods 
most robust to this effect (Minchin 1987). 

Multivariate analysis is extremely versatile and 
can be used to detect effects of disturbances on 
community composition in several ways. Commu¬ 
nities can be tracked through time as they undergo 
a disturbance and through the recovery trajectory 
(Warwick & Clarke 1993). We can analyse sampling 
units distributed across a disturbance gradient to 
test whether community composition and species 
abundances reflect the level of disturbance (Hewitt 


et al. 2005). The effects of disturbances can also 
be quantified by a community disturbance index 
(CDI), which compares disturbed sites to previ¬ 
ously scaled boundaries of undisturbed commu¬ 
nities (Flaten et al. 2007). Loss of 3 diversity (i.e. 
homogenization of the landscape) can be detected 
by comparing the dispersion of sampling units in 
multivariate space (Anderson et al. 2006). 

The application of multivariate analysis to detect 
the effects of disturbances on biodiversity is poten¬ 
tially very powerful. This approach can detect 
effects not reflected by metrics, like species richness 
or diversity, because it takes into account the iden¬ 
tity of species (Hewitt et al. 2005). However, for this 
same reason multivariate approaches are highly site 
specific. Thus, biogeographic influence and habitat 
specificity can hide the effects of disturbances when 
large or isolated regions are included in the same 
analysis. Consequently, multivariate approaches 
are not suitable for generalizations about the effects 
of a disturbance at global scales (although some 
approaches have been proposed (Warwick & Clarke 
1993)) or across habitats. Additionally, multivariate 
analysis is conceptually complex and sometimes 
difficult to interpret. Consequently, a good under¬ 
standing of these techniques is a fundamental pre¬ 
requisite to successfully using this approach. 

17.4 Prospectus 

There is a vast array of methods available to mea¬ 
sure the effects of disturbance on biodiversity pat¬ 
terns. In this review we grouped metrics into three 
categories: univariate indices condense all informa¬ 
tion about each community into a single number, 
SAD-based metrics consider distributions of abun¬ 
dance or traits in the communities, and multivari¬ 
ate approaches retain information regarding species 
identities. In principle, the ability to detect distur¬ 
bances should increase along these groups. How¬ 
ever, there are trade-offs between metric sensitivity 
and both the generality and the interpretability of 
the results, therefore the best metric is not necessar¬ 
ily the most sensitive. 

In fact, there is no single best metric to detect 
the effects of disturbances on biodiversity. This 
is because both disturbance and biodiversity are 
complex concepts that include multiple processes 
and patterns. The best metric depends both on 
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the type of disturbance (see Introduction) and the 
type of effects we are interested in detecting (see 
below). Nevertheless, from the vast array of metrics 
available, some perform better than others and 
some are more suited to certain types disturbances, 
available data, or to detect specific effects. Table 17.1 
summarizes the metrics reviewed in this chapter 
and highlights some of their advantages and disad¬ 
vantages. Although some studies compare the rel¬ 
ative performance of different biodiversity metrics 
(e.g. Abella et al. 2006; Patricio et al. 2006; Salas et al. 
2006a,b; Ware et al. 2008), further synthetic research 
is needed so that best approach standards emerge. 

One conclusion that emerges from this review is 
the importance of explicitly identifying what type 
of effects a study is aiming to detect. The aim of 
detecting any effects of a disturbance is too vague 
to allow the selection of an appropriate method. 
Specifically, we may be interested in measuring 
whether the total number of species in a community 
is affected by a disturbance, in which case species 
richness is the obvious metric. If we are particularly 
concerned with the fate of rare species, because 
they are of special conservation interest for exam¬ 
ple, we should probably use a SAD-based metric. 
Some studies will focus on particular or dominant 
species to assess the risk posed by pests or species 
invasions, and need to use metrics that include 
composition information. We may also be interested 
in how biodiversity relates to ecosystem services, 
and need to consider metrics that include species 
traits. Once the type of effect we are trying to detect 
is identified, the choice of biodiversity metric fol¬ 
lows. Moreover, using a range of different metrics 
to capture different aspects of diversity will often be 
important. Thanks to the development of comput¬ 
ing abilities and software, computational difficul¬ 
ties are no longer a constraint. We note that promot¬ 
ing the use of novel methods will hinge on the avail¬ 
ability of the software needed to compute them. 

We want to highlight a few metrics that in 
our opinion are particularly promising. Firstly, 
trait-based metrics allow the detection of changes 
that may be subtle at a taxonomic level, but have 
crucial implications for ecosystem functioning and 
services. Secondly, although SAD-based metrics 
have been around since the 1970s, we feel their 
full potential is yet to be explored and encourage 
their use in the context of detecting the effects of 


disturbances. The eCDF in particular, because of its 
simplicity and independence from any parametric 
model, is a promising approach. Thirdly, 
multivariate metrics are extremely versatile and 
ideal to target questions where species identities 
are important. Non-metric MDS (NMDS) seems 
to be the most robust method for the type of 
data typical in this field. Finally, we believe it is 
important to report standard metrics, and species 
richness in particular, for comparative purposes. 

On a final note we call for theoretical develop¬ 
ment in this area. Ultimately, rather them measure 
destruction we aim to predict the effects of distur¬ 
bances before they occur, in order to manage nat¬ 
ural resources in the most effective way. For that, we 
need to develop theory for how aspects of diversity 
other than species richness are affected by distur¬ 
bances. We feel that much more needs to be done 
on this front. 

17.5 Key points 

1. The biodiversity metric most appropriate to 
quantify the effects of disturbance depends on 
the nature of the disturbance, the type of effect, 
and the data available. Often the use of multiple 
metrics is necessary for a good understanding of 
the consequences of disturbances. 

2. Species richness and evenness are two standard 
univariate metrics that complement each other. 

3. Trait diversity is an important metric because 
it allows the consequences of disturbances to 
ecosystem services to be explored. 

4. From the SAD-based metrics, the eCDF is a 
promising new approach. It is mathematically 
familiar, simple to interpret, and easy to use. 

5. NMDS seems to be the multivariate approach 
most appropriate for the type of data typically 
available in disturbance studies. 

6. More theoretical work is needed to understand 
the effects of disturbances on biodiversity met¬ 
rics other than species richness. 
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CHAPTER 18 


Measuring biodiversity 
in managed landscapes 

Steven L. Chown and Melodie A. McGeoch 


18.1 Introduction 

One thing that is clear is that the majority of land¬ 
scapes, in a physical sense, are the result of interac¬ 
tion between man and nature, and of man's use and 
misuse of nature. 

Gunnarsson 2006 
A Mirror of Nature 
Nordic Landscape Painting 1840-1910 

If one wishes to explain the distribution and num¬ 
bers of a certain species of animal, here is one way of 
going about the job which we have found especially 
profitable. It is best to describe the method as if it 
were done in three stages, although, in practice, it is 
usually best to have the three stages going forward 
at the same time. 

a) The physiology and behaviour of the animal 
must be investigated... 

b) The physiography, climate, soil and vegetation 
in the area must be investigated... 

c) The numbers of individuals in the population 
that has been selected for study must be mea¬ 
sured as accurately as practicable. 

Andrewartha & Birch 1954 
The Distribution and Abundance of Animals 

Over the past century the human population has 
grown from an estimated 1.66 billion to 6.79 billion. 
It is projected to reach c. 9 billion by 2050 with 
a current growth rate of, on average, 1.17% p.a. 
(Anonymous 1999; CIA Fact Book 2009). Despite 
considerable spatial and temporal fluctuations in 
birth and death rates, often correlated with chang¬ 
ing climates (Zhang et al. 2007), the human popu¬ 
lation has seen increasing growth since the black 


death that ravaged much of Eurasia in the 14th 
century (Cohen 1995; Anonymous 1999; Marriott 
2002). This growth has been supported by and 
stimulated ongoing changes in the ways in which 
humans interact with the world, including the agri¬ 
cultural and industrial revolutions (Harrison 1992; 
Cohen 1995). These changes have led to modifica¬ 
tion of a substantial proportion of the planet's sur¬ 
face, including its atmosphere, and freshwater and 
marine systems. Indeed, somewhere between one- 
third and a half of the land surface has been trans¬ 
formed by human activities, and marine systems 
have seen similar, although less visible, changes 
(Vitousek et al. 1997; Watling & Norse 1998; Jack- 
son et al. 2001; Klein Goldewijk 2001; Milesi et al. 
2005; Ellis & Ramankutty 2008). Even those areas 
that might at first appear largely untransformed 
are being affected indirectly, either by human influ¬ 
ences on the global climate (Parmesan 2006; le Roux 
& McGeoch 2008; Chen et al. 2009) or through the 
introduction of self-sustaining and often increasing 
populations of invasive alien species (Foxcroft et al. 
2004; Dawson et al. 2009). Moreover, many of these 
apparently untransformed areas have long histories 
of human influence (e.g. Mabunda et al. 2003) and 
are currently subject to some form of management. 
The latter is certainly true of all but the globe's 
largest protected areas. Even there, management 
policies that in several instances involve direct man¬ 
agement (e.g. fire and herbivore management) have 
typically been formulated (DuToit 2003; Govender 
et al. 2006). In consequence, when biodiversity is 
measured, more often than not it is measured in or 
across landscapes under some form of management 
or exposure to human-induced transformation (see 
Gaston (2006) for further discussion). 
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A major consequence of this preponderance of 
managed and modified landscapes is that much 
of what is known about how and why biodi¬ 
versity varies through space and time, and how 
this variation should be quantified, refers either 
directly or indirectly to such landscapes. Indeed, 
it might be argued that for many purposes, the 
kinds of methods developed for measuring bio¬ 
diversity more generally (as reviewed throughout 
this book and elsewhere, e.g. Blackburn and Gas¬ 
ton (1998); Epperson (2003); Magurran (2004); and 
Gaston et al. (2009)) are as much applicable to mod¬ 
ified and managed landscapes as they are to all 
others. No particularly special set of methods is 
required. However, as with all such work, the form, 
structure, grain, and varying age of the elements 
of the landscape should be considered carefully, as 
should the scale or scales at which the study should 
be undertaken and the sampling methods required 
to address the issues at hand (see Chapters 2-15). 
When biodiversity is measured in what is more typ¬ 
ically considered a managed landscape (such as a 
set of fields with hedgerows or wheat fields with 
small remnant patches of Renosterveld (Samways 
1989; Donaldson et al. 2002)) the goals often con¬ 
cern questions about the management of particular 
features of the landscape, be they individuals, pop¬ 
ulations, metacommunities, or system functioning. 
In a conservation context, the goals of measurement 
are often how to manage to reduce levels of trans¬ 
formation or the impact thereof. 

From this perspective, modification of the land¬ 
scape and management thereof can be considered 
orthogonal axes, along which different regimes 
can clearly be set out and for which the over¬ 
all goals of biodiversity measurement may differ 
substantially (Fig. 18.1). Therefore, in this chap¬ 
ter, rather than specifically reviewing the meth¬ 
ods that might be used for measuring biodiversity, 
which are dealt with elsewhere in the book, we 
shift the focus to: (i) the ways in which the goals 
of biodiversity measurement vary with the extent 
of modification and management of landscapes, 
which is partly also a question of resolution and 
extent of the investigation, (ii) specific examples of 
approaches that have been successful in addressing 
these goals, and (iii) the benefits of adopting r x c 
matrices for integrating biodiversity measures and 


understanding across several levels of the biological 
hierarchy. 

18.2 State of the field 

18.2.1 Variation in biodiversity measurement 
goals 

Across the range of variation in the extent of land¬ 
scape modification and management (Fig. 18.1), the 
goals of biodiversity measurement typically dif¬ 
fer substantially. Moreover, the spatial scale (extent 
and resolution) of the studies tends to covary 
along the axes of modification and management. 
For example, investigations of the patterns in and 
mechanisms underlying large-scale variation in 
biodiversity (usually richness, but also turnover, 
abundance, range size, and body size, e.g. Chown 
et al. (1998); Pautasso & Gaston (2005); Buckley 
& Jetz (2008); Dunn et al. (2009); and Olson et al. 
(2009)) usually assume that the signal of human 
landscape transformation has not markedly influ¬ 
enced the patterns detected at large spatial extents 
and resolutions, and the mechanisms thought to 
underlie them (see discussion in Gaston (2006)). 
More often the concerns lie with the influence of 
different kinds of data (e.g. range maps vs museum 
specimen records, see Hurlbert & White (2005), Gra¬ 
ham & Hijmans (2006), Rondinini et al. (2006), and 
McPherson & Jetz (2007)), and spatial extent and 
resolution (Rahbek & Graves 2001; van Rensburg 
et al. 2002; Hurlbert & Jetz 2007; Mora et al. 2008) on 
the outcomes of the work, and on how the measure¬ 
ment approach varies with extent and resolution 
(e.g. the census area effect; Gaston et al. (1999)). 

At global and continental scales the assumption 
that the non-human signal still predominates seems 
justified (although poorly explored, but see Ellis 
& Ramankutty (2008) for a different perspective), 
but at somewhat smaller scales, such as for partic¬ 
ular countries or sub-regions, landscape transfor¬ 
mation certainly does appear to affect patterns of 
richness, composition, abundance, and their inter¬ 
actions (e.g. Fisher & Frank 2004; Evans et al. 2006a; 
Gaston 2006; Leprieur et al. 2008). Because the goals 
of sub-regional or country studies typically are to 
investigate the form of macro-scale variation and 
the mechanisms underlying it (including variation 
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Figure 18.1 The relationship between landscape management intensity, transformation, land-use type, and (right axis) varying objectives for biodiversity 
measurement. 
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in single species, e.g. Moodley & Bruford (2007) 
and Evans et al. (2009)), understanding covariation 
between human activities, population density, and 
measures of biodiversity must form a component 
thereof (Chown et al. 2003; Gaston 2006). In this 
context it is important to recognize that human 
population density and species richness are often 
positively related, largely because both respond 
similarly to environmental drivers such as pro¬ 
ductive energy availability and elevation (Chown 
et al. 2003; Milesi et al. 2005; Luck 2007). Moreover, 
these relationships are likely to change markedly 
as productivity continues to increase in a spatially 
variable manner (Nemani et al. 2003), as human 
population size continues to increase more in areas 
where net primary productivity is water limited 
(Milesi et al. 2005), and, in many countries, as 
urbanization continues to exert an increasing influ¬ 
ence on the landscape (Latimer et al. 2004; McK¬ 
inney 2006; Evans et al. 2006b; Tratalos et al. 2007; 
McDonald et al. 2008). 

As the levels of landscape transformation and 
management increase, so the variety of research 
questions posed, and approaches adopted, tends 


to become larger, and often also more context spe¬ 
cific. For example, at the scale of whole, very 
large protected areas (those above 1 million ha) 
or catchments (scales where management intensi¬ 
ties might be considered intermediate (Fig. 18.1)), 
the questions may vary from those concerned with 
variation in the richness of indigenous and/or 
invasive species (McKinney 2002; Stohlgren et al. 
2006; Foxcroft et al. 2009), to those dealing with 
population variation and the environmental and 
management influences thereon (Ogutu & Owen- 
Smith 2003; Owen-Smith et al. 2006). Alternatively, 
questions may concern the influence of particular 
management regimes on assemblage structure and 
composition, such as the impact of fire regimes 
on biodiversity in savanna parks (Parr et al. 2004; 
Govender et al. 2006). In addition to the usual 
recommendations for biodiversity measurement in 
such studies, much more context-dependent mea¬ 
surement problems arise. For example, despite the 
significance of understanding the impacts of dif¬ 
ferent burning regimes on biodiversity in fire- 
prone systems in southern Africa (e.g. Govender 
et al. 2006), much of the work on animals has 
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been characterized by inadequate replication, lack 
of attention to fire duration, season, and time of 
day, poor reporting of spatial extent and sam¬ 
ple unit size, and poor representation of inverte¬ 
brates, amphibians, and reptiles in these investi¬ 
gations (Parr & Chown 2003). Often the need to 
address these kinds of problems extends to other 
geographic areas and represents a more general 
recognition of the value of increased experimental 
rigour or new approaches. In some instances, the 
concerns lie with the application of extant and read¬ 
ily accessible theory and techniques (e.g. the take 
up of information theoretic approaches to model 
selection; review in Johnson & Omland (2004)). In 
others, relatively recent theoretical and technolog¬ 
ical developments are rapidly extending the scope 
of current understanding. For example, develop¬ 
ments in theory and computer technology that have 
enabled spatially explicit approaches to be adopted 
are bringing about major changes in such areas as 
population viability assessment and the assessment 
of how populations are likely to respond to changes 
in landscape variation over time (Porter et al. 2002; 
Akcakaya et al. 2004; Carroll 2006; McRae et al. 
2008). In essence, these developments form a com¬ 
ponent of the r x c matrix approach that will be 
discussed below. 

At the greatest levels of modification and man¬ 
agement (Fig. 18.1), especially in intensive agricul¬ 
ture or small, intensively managed reserves, the 
nature of biodiversity measurement is often sim¬ 
ilar, albeit focused simply on either increasing or 
otherwise manipulating diversity and population 
sizes. For example, the assessment of population 
dynamics may be aimed at either controlling a prob¬ 
lem species (population reduction) or conserving 
a rare one (population maintenance or increase). 
Thus, invasive species may be the subject of con¬ 
trol in conservation areas, whereas the conservation 
of functionally significant species may be the goal 
in an agricultural context. Whether these opposing 
goals are being realized in a protected area or inten¬ 
sively farmed landscape, the methods adopted are 
often similar and incorporate work across almost 
the full range of the biological hierarchy, from the 
level of genes to the level of landscapes and ecosys¬ 
tems. The former may involve the assessment of 
the risks of introgression from genetically modified 


organisms to crop or to wild species (e.g. Zapiola 
et al. 2008; Pineyro-Nelson et al. 2009; McGeoch 
et al. 2009), whilst the latter may concern the effects 
of the spatial arrangement of patches and corri¬ 
dors on the diversity of assemblages (Damschen 
et al. 2006), the ways different ecosystems interact 
to affect local diversity (Knight et al. 2005), or the 
most appropriate management actions for restoring 
diversity at the landscape level (e.g. Rand et al. 
2006; Richardson et al. 2007). By its very nature, 
much of this measurement concerns the use of 
subsets of biodiversity that are representative of 
broader responses, and the use of these subsets 
to monitor changes in environmental state or the 
status of biodiversity per se. 

18.2.2 Bioindicators and monitoring 

Bioindication and monitoring have long been fea¬ 
tures of pest management, ecotoxicology, and con¬ 
servation (van Straalen & Verhoef 1997; McGeoch 
et al. 2002; Bockstaller & Girardin 2003). Bioindi¬ 
cators that readily reflect and represent the state 
of the environment are considered an effective and 
necessary approach to gathering sufficient informa¬ 
tion whilst minimizing the resources required to do 
so (McGeoch 1998). In conservation, they are con¬ 
sidered especially appropriate for addressing the 
challenges of understanding and mitigating human 
influences on biodiversity. Three main categories 
of bioindication can be recognized, each of which 
has its own goals (Table 18.1). Nonetheless, they 
overlap to some extent and share several method¬ 
ological principles. Key amongst these is the pre¬ 
dictability of the relationship between the bioindi¬ 
cator and the environmental variable of interest 
(abiotic or biotic, i.e. in biodiversity indication this 
could be the relationship between the abundance 
of a single species and the diversity of an assem¬ 
blage, e.g. Fleishman et al. (2005)). Thus the indica¬ 
tor and variable of interest should (i) have a signifi¬ 
cant and strong relationship (statistical significance 
should preferably be accompanied by large effect 
size) and (ii) be characterized by a robust relation¬ 
ship, i.e. the relationship should remain constant 
within the spatial and temporal context of inference, 
or should be consistent and repeatable (McGeoch 
2007; Table 18.2). The value of the indicator will 
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Table 18.1 The three categories of bioindication, distinguished by their distinct objectives, and their application in measuring biodiversity in managed 
and transformed landscapes. 

Category 

Environmental indicator 

A species or group of species that responds predictably, in ways that are readily observed and quantified, to often abiotic environmental 
disturbance or to a change in environmental state. 

Related terms: Sentinel, exploiter, bioassay, accumulator, biomarker. 

Example The community structure of soil arthropods is indicative of soil properties (e.g. pH) and the impact of human activities (van Straalen 1998). 

Ecological indicator 

A species or group of species that demonstrate the effects of environmental change (such as habitat alteration and fragmentation and climate 
change) on biota or biotic systems. 

Example The abundance and richness of trap-nesting bees are indicative of habitat quality and ecological change in European agricultural landscapes 
(Tscharntke et al. 1998). 

Biodiversity indicator 

A biodiversity indicator is a group oftaxa (e.g. genus, tribe, family, or order, or a selected group of species from a range of higher taxa) or 
functional group, the diversity of which reflects some measure of the diversity (e.g. character richness, species richness, level of endemism) of 
other higher taxa in a habitat or set of habitats. 

Related terms: Surrogate, umbrella, flagship, focal species or taxon. 

Example Occurrence patterns of selected butterfly species are indicative of substantial, significant variation in combined bird and butterfly species 
richness in the Central Great Basin (Fleishman et al. 2005). 


increase if the additional criteria of representivity 
and generality are met (Fig. 18.2). If the bioindica¬ 
tor, or bioindication system (van Straalen & Verhoef 
1997), does not comply with these criteria, includ¬ 
ing clear objectives, ease of sampling, cost-efficacy, 
and taxonomic knowledge (McGeoch 1998, Law- 
ton & Gaston 2001; Chapter 2), then, irrespective 
of the methodological correctness of the sampling 
and/or measurement approaches, it is unlikely to 
prove useful. Much biodiversity measurement in 
the context of bioindication is aimed at quantifying 
and testing these ideas. 

The value of bioindicators ultimately lies in 
their adoption and application in monitoring pro¬ 
grammes, e.g. for monitoring the success of restora¬ 
tion efforts (Albrecht et al. 2007), the effect of river 
flow management on fish diversity (Taylor et al. 
2008), or the effectiveness of protected areas in con¬ 
serving biodiversity (Jackson & Gaston 2008). The 
functions of monitoring programmes are nonethe¬ 
less clearly closely allied with- the functions of 
bioindicators (McGeoch 1998, 2007). Importantly, 
where monitoring systems are established to detect 
the influence of change on any particular level 
of the biological hierarchy, thresholds and their 


identification become particularly significant (Biggs 
et al. 2009). Here it is important to distinguish 
between ecological thresholds (values of system 
state variables) and decision thresholds (thresholds 
at which management actions are taken and which 
may be determined by societal values rather than 
ecological thresholds per se) (Martin et al. 2009). 

Much attention has rightly been given to the 
establishment of monitoring programmes for bio¬ 
diversity management in both control and conser¬ 
vation settings (Mace & Baillie 2007; Regan et al. 
2008; Foxcroft et al. 2009). However, some of the 
most useful detections of long-term change in bio¬ 
diversity have emerged from biodiversity measure¬ 
ment activities that initially did not have these goals 
in mind. For example, much useful information 
has arisen from recent repetitions of surveys pre¬ 
viously conducted 20-40 years ago, especially on 
long-term, climate change-related alterations in pat¬ 
terns of diversity, from genetic to assemblage levels. 
Insights from such work include shifts in the latitu¬ 
dinal dine in the alcohol dehydrogenase polymor¬ 
phism in Drosophila melanogaster (Umina et al. 2005), 
changes in body size patterns in one or more popu¬ 
lations (Gilchrist et al. 2004), declines in abundance 
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Table 18.2 Procedural steps in the development of a bioindication system for use in managed and transformed landscapes (adapted from McGeoch 
1998). 


Step 

Action 

Explanation 

1 

Determine broad objective as environmental, ecological or biodiversity 
indication 

See Table 18.1 

2 

Refine objective and clarify endpoint 

Place broad objective into specific context of interest and define 
context-specific objectives, e.g. to detect changes in environmental 
quality as a result of changing pesticide usage or to detect the 
impacts of increasing habitat fragmentation on ecosystem 
functioning 

3 

Select potential indicator based on accepted a priori selection criteria, 
e.g. cost-effectiveness, usefulness demonstrated in other regions, 
taxonomically well known 

Select appropriate species, higher level taxon, assemblage, or 
community (decision will be scale dependent) 

4 

Accumulate data on the relationship between the putative bioindicator 
and the environmental (relational) variable/stressor of interest 

For example, measure the response of the indicator to the disturbance 
or environmental variable of interest by measuring both the response 
and the environmental variable(s) 

5 

Establish statistically the relationship between the putative 
bioindicator and the environmental (relational) variable 

Determine both the significance and the strength of the relationship 
(Fig. 18.2) 

6 

Based on the nature of the relationship, either accept (preliminarily) or 
reject the species, taxon, or assemblage as a bioindicator 

Is the relationship significant and strong (e.g. high coefficient of 
determination)? 

If yes, then move on to step 7, if not, repeat from step 3 

7 

Establish the robustness of the bioindicator by developing and testing 
relevant hypotheses 

For example, test the relationship established in step 6 in following 
years, different seasons, or in the same context but at different sites 
or regions (Fig. 18.2) 

8 

If the null hypotheses are rejected, make specific recommendations 
based on the original objectives for the application of the 
bioindicator 

Develop the bioindicator as a tool for use in monitoring programmes for 
management, for example by the development of user guides, 
pictorial keys, and decision-support systems 

9 

Establish the generality of the bioindicator (optional) 

Test the same set of hypotheses developed in step 7 in different 
contexts, for example different agroecosytems, different bioregions 
(temperate versus tropics), or different disturbance or management 
regimes (Fig. 18.2) 


of particular sensitive groups, such as salamanders 
(Rovito et al. 2009), and alterations in the ranges 
and assemblage composition of plants and insects 
(Kelly & Goulden 2008; le Roux & McGeoch 2008; 
Raxworthy et al. 2008; Chen et al. 2009). Often such 
studies draw attention to deficiencies in the earlier 
surveys that constrain current understanding, but 
which of course, importantly, serve to inform the 
way in which new surveys should be designed. 
Where a set of sites is to be re-surveyed given histor¬ 
ical data, much merit lies in assessing carefully how 
best to replicate the past work to ensure as much 
comparability as possible, whilst also providing an 
approach that may prove to be more valuable to 
those in the future who might wish to survey the 
site again. 


These examples show how thoughtful consid¬ 
eration of previous work (where the full data set 
was often published; see Lawton (1999a)) can pro¬ 
vide insights that might otherwise not have been 
possible. Indeed, substantial scope exists for the 
combination of innovative approaches and extant 
data to provide further understanding of biodi¬ 
versity changes in modern landscapes and of eco¬ 
logical questions more generally. Although such 
approaches might not be considered 'monitoring' in 
the conventional sense, they certainly provide the 
kinds of information that monitoring approaches 
seek to deliver. A few additional examples readily 
illustrate the point. First, the latitudinal gradient 
in richness is one of the most well known biolog¬ 
ical patterns. Typically it is seen as a pattern that 
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Value in monitoring and landscape management 


Relationship Robustness Representivity Generality 


1. Statistical 
significance. 

This is a necessary 
condition. 

2. Strength. 

The greater the 
strength 

of the relationship, 
the more valuable 
the bioindicator 
in monitoring and 
management. 


Spatial and temporal 
variability in measures 
of the relationship 
within the context of 
the bioindication 
objective. 

The lower the 
variability, the more 
robust the indicator. 


The degree to which 
the bioindicator 
represents the 
response of other taxa 
to the same object of 
indication. 

The object of indicator 
here may be a stressor 
(e.g. fragmentation, 
pesticide application or 
other form of 
disturbance) or the 
diversity or abundance 
of other taxa. 


The degree to which 
the first three levels of 
predicatbility hold 
when extrapolated to 
scenarios other than 
that for which the 
bioindication system 
was developed. 

For example, the 
bioindication system 
may be applied in 
different 
agroecosystem, 
protected area or 
geographic region. 



Figure 18.2 The predictability gradient in bioindication. As a biological relationship moves along the predictability gradient (left to right), so its value as 
a bioindicator increases and hence its value for assessing the status of, and monitoring changes in, biodiversity in managed and transformed landscapes 
(adapted from McGeoch (2007)). Note that representivity is not relevant in all bioindication contexts. 


demonstrates relatively long-term changes (Crame 
2001), but which shows the bulk of its variation 
among taxa, systems, and regions (Gaston 1996b; 
Hillebrand 2004). Recent work using long-term fish¬ 
eries data collected in the northwest Atlantic has 
shown not only that the gradient exhibits inter¬ 
annual variation, but also that this variation is 
strongly related to changes in the sign and strength 
of the North Atlantic Oscillation (Fisher et al, 
2008). Second, the extent to which the phenotypic 
response to climate change (such as advances in 
arrival dates or changes in phenology; see Parme¬ 
san (2006)) have a microevolutionary basis or are 
the result mostly of environmental responses is a 
significant question for understanding the likely 
responses of organisms to further change (Davis 
& Shaw 2001; Chown & Terblanche 2007; Gien- 
app et al. 2008). To disentangle these processes, 
information on the relatedness of individuals is 
usually required. Recently, Gienapp et al. (2007) 
proposed an approach using phenotypic mea¬ 
surements through time, converted to haldanes, 
which provides at least an estimate of whether 
the rates of change are within those expected 
from theory and empirical studies of evolution¬ 
ary change (e.g. Lynch & Lande 1993; Hendry 
& Kinnison 1999). If rates of phenotypic change 


lie above a few per cent of 1 haldane, then 
the change is probably not only a consequence 
of microevolutionary change in the trait mean. 
Finally, owing to the substantial and rapid effects 
humans may have on phenotypic change in ani¬ 
mals (Hendry et al. 2008), and on the apparently 
rapid response of size to changes in the envi¬ 
ronment (e.g. Gilchrist et al. (2004), although not 
always associated with genetic change; see Teplit- 
sky et al. (2008)), utilization of size data, which can 
often be obtained readily from museum specimens, 
may provide considerable insight into environmen¬ 
tal change in the absence of specific monitoring 
programmes. 

18.2.3 Measuring biodiversity for 
management 

Having established that the goals of biodiversity 
measurement in modified and managed landscapes 
vary with the extent of management and modifi¬ 
cation, and clearly with the scale of the study or 
question being addressed, we provide here sev¬ 
eral sets of examples that have used a variety of 
methods to address these goals. Our aim is not to 
suggest that these are the definitive measurement 
approaches that should be adopted, but rather to 
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illustrate the variety of methods that can usefully 
be applied to biodiversity measurement in modified 
landscapes. 

Human influences on avian species richness 
That humans have an influence on diversity is 
widely accepted. However, the extent to which 
conservation areas might mitigate this influence 
is often not explicitly demonstrated, despite the 
fact that doing so is of considerable significance 
(Gaston et al. 2008b). This is especially the case 
with species for which protected areas were not 
primarily intended. Moreover, the converse also 
applies: studies of broad-scale patterns in biodiver¬ 
sity should consider human influences more gen¬ 
erally (see section 18.2.1). Using bird atlas species 
richness data, at a quarter degree square resolution 
for South Africa, energy availability estimated in 
the form of the normalized difference vegetation 
index, proportion of a grid square covered by a for¬ 
mally protected area, and a generalized linear mod¬ 
elling approach, Evans et al. (2006a) demonstrated 
that conservation areas are effective. In low-energy 
areas, species richness typically increases from 64 to 
85 species as protection increases from 0 to 100%. 
By contrast, in higher energy areas, the increase 
is larger—from 213 to 250 species. This broader 
scale work prompted the local-scale investigation 
of three protected areas (established for purposes 
other than avian conservation) and the landscapes 
surrounding them, demonstrating that these pro¬ 
tected areas were indeed effective in conserving 
both avian richness and particular foraging guilds 
(Greve 2007). 

Enhancing biodiversity in agricultural and forestry 
landscapes 

Measurement of biodiversity in landscapes man¬ 
aged for forestry or agriculture is generally aimed 
at either (i) enhancing ecosystem services pro¬ 
vided by biodiversity that are of value in such 
settings (e.g. biological control via natural ene¬ 
mies, pollination and provision of host plants and 
habitats for such species) (Bianchi et al. 2006) or 
(ii) improving the general biodiversity conserva¬ 
tion value of regions that are largely transformed 
and have a long history thereof (e.g. Kleijn et al. 
2006; McGeoch et al. 2007). Work on the general 


goal of enhancing biodiversity in European agro¬ 
ecosystems has revealed that much more targeted 
objectives are required to achieve specific biodi¬ 
versity outcomes (Box 18.1). Different management 
measures were shown to be required for the alterna¬ 
tive goals of benefiting common species (often those 
performing important roles as predators and pol¬ 
linators in fields) versus improving the conserva¬ 
tion value of agro-ecosystems by providing suitable 
habitat for rare species (Kleijn et al. 2006). 

The work of Tscharntke et al. (2005) also 
clearly illustrates the importance of the relation¬ 
ship between landscape transformation and man¬ 
agement shown in Fig. 18.2. In highly transformed 
agricultural settings, management interventions to 
enhance biodiversity are not very effective (because 
there is little available source habitat), whereas 
less transformed landscapes are more respon¬ 
sive to biodiversity-oriented management practices 
(Box 18.1). Furthermore, the species richness of 
human-dominated landscapes provides insight into 
the functional characteristics of such landscapes. 
For example, Tscharntke et al. (2008) showed 
that tropical agricultural landscapes not only sup¬ 
ported fewer species of birds and insects than 
semi-natural habitats, but that functional diversity 
also differed between them. Thus, more frugivores 
and nectivores but fewer insectivores occurred in 
the agroforest avian assemblage compared with 
unmanaged forests. Furthermore, for both birds 
and insects, assemblages were more similar across 
agricultural landscapes (lower beta diversity) than 
they were across natural landscapes. The study also 
demonstrated that the scale of landscape structures 
is an important determinant of biodiversity across 
managed landscapes, as the distance between 
managed and unmanaged patches of habitat has 
a significant effect on richness and functional 
diversity. 

Further significant insights have been gained by 
adopting spatially scaled approaches to measuring 
biodiversity across mixed-use landscapes (Steffan- 
Dewenter et al. 2002; Box 18.1). For example, spi¬ 
der diversity in cereal crops responded to both 
local (higher diversity at field edges than centres) 
and landscape (more diverse in complex than in 
simple) scale structures (Clough et al. 2007). This 
scale dependence of diversity patterns and drivers 
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Box 18.1 Measuring biodiversity in agricultural landscapes: objectives and insights 


Significant insight has been gained from the measurement 
of biodiversity in agricultural landscapes on the patterns 
and drivers of biodiversity in managed landscapes, as well 
as their scale dependence. The interdependence of the dual 
goals of maximizing biodiversity and function in managed 
landscapes is also well illustrated from work done in 
this area. 



Figure Box 18.01a The number of flower-visiting bee species 
increases as the proportion of semi-natural habitat in the 
pre-dominantly agricultural landscape increases (redrawn from 
Tscharntke et al. (2005), but see Steffan-Dewenter et al. (2002)). 



Figure Box 18.01 b The number of pumpkin seeds per fruit 
increases as the number of pollinating bee guilds increases (redrawn 
from Hoehn et al. (2008)), a clear illustration of the functional 
importance of biodiversity. 


is one of the fundamental considerations during the 
design and analysis of biodiversity measurement 
studies (Box 18.1; see also Chapters 11 and 12). 


The figures below, for example, illustrate the positive 
relationships between biodiversity and ecosystem 
functioning in an agricultural context, how management 
effectiveness is determined by landscape structure, and a 
successful sample design for understanding the scale 
dependence of these relationships. 



Figure Box 18.01c The hump-shaped relationship between the 
degree of landscape transformation (from cleared to complex) and the 
effectiveness of managed schemes aimed at enhancing biodiversity 
(redrawn from Tscharntke et al. (2005); see text for explanation). 



Figure Box 18.01d A schematic of the hierarchically scaled 
sampling design (extending radii of 250 m) used by Steffan-Dewenter 
et al. (2002) and others to demonstrate the importance to biodiversity 
not only of the amount of habitat, but also the spatial characteristics of 
such habitat, including the distribution, size, and total area of land 
under different levels of management intensity (from natural, 
unmanaged to intensively managed). 

The bioindication value of taxa for reflecting the 
functional status of such settings has also been 
examined, for example using trap-nesting bees and 






MEASURING BIODIVERSITY IN MANAGED LANDSCAPES 261 


wasps. A range of independent studies has demon¬ 
strated that aculeate bees (specifically their rich¬ 
ness and abundance), and to a lesser extent wasps, 
respond significantly and consistently to habitat 
isolation and plant diversity (the latter indica¬ 
tive of food and nest site availability) (Tscharntke 
et al. 1998). This group of Hymenoptera has addi¬ 
tional value as a bioindicator because its members 
are functionally significant in agro-ecosystems (as 
pollinators and in biological control). Monitoring 
groups such as this is therefore likely to be essential 
for evaluating the biodiversity value and functional 
status of many agricultural and mixed-use land¬ 
scapes (Schulze et al. 2004). 

Several of these relationships between biodiver¬ 
sity, landscape structure, and management have 
been measured and tested not only across sites 
within regions, but also across European and tropi¬ 
cal systems, and have been found to be similar (i.e. 
the ideas have gained some degree of generality; 
Fig. 18.2). In consequence, the importance of hetero¬ 
geneity for maintaining biodiversity in landscapes 
managed for agriculture and forestry is now well 
supported. 

A macroecological approach to measuring biodiversity 
in managed landscapes 

Recently, an alternative, macroecological, approach 
to understanding the impact of management on 
biodiversity has been proposed. It is based on 
the rationale that given the contingencies asso¬ 
ciated with each management system and man¬ 
aged landscape, an approach that is able to be 
applied to the measurement of any system, in com¬ 
parable units, may better reveal general assem¬ 
blage responses to management intensity than 
those that focus on assemblage diversity and com¬ 
position. Domelas et al. (2009) thus proposed 
that the shape of species abundance distributions 
(SADs, the frequency distribution of species abun¬ 
dances in an assemblage expressed as a histogram, 
see Chapters 9 and 17) may be used to infer 
the relative degree of environmental heterogene¬ 
ity. Although, for sufficiently large sample sizes, 
SADs are approximately log-normal, the authors 
propose that the deviation from log-normal is sys¬ 
tematically related to the degree of environmental 
heterogeneity. Specifically they hypothesized that 


heterogeneity increases multimodality in SADs and 
that this effect of the shape of SADs is produced 
regardless of the mechanism generating that hetero¬ 
geneity. Using weed, seed-bank communities sub¬ 
jected to a range of agronomic treatments (e.g. fertil¬ 
ization, ploughing and different cover crops), Dor- 
nelas et al. (2009) demonstrated that environmental 
heterogeneity does indeed, as predicted, generally 
produce more multimodal SADs. This hypothesis 
is ripe for further testing across a range of man¬ 
agement intensity gradients, and if supported will 
add substantially to our current understanding of 
biodiversity in managed landscapes. 

18.2.4 Matrices for measurement 

The quote by Andrewarth and Birch (1954) at the 
start of the chapter nicely reflects early recogni¬ 
tion that to understand pattern in and mecha¬ 
nisms underlying abundance variation, measures 
of the environment, the physiological and behav¬ 
ioural responses of individuals, and of abundance 
are required. The same could be said of many, if not 
most, levels in the biological hierarchy. Although 
the fields typically occupied with such measure¬ 
ments developed separately over much of the 20th 
century (Gaston et al. 2009), current work shows 
that they are being re-integrated, especially with 
substantial modern efforts to understand the effects 
of human-mediated climate and landscape change 
(Chown & Terblanche 2007; Soberon 2007; Chown 
& Gaston 2008; Kearney & Porter 2009; Gaston et al. 
2009). 

Perhaps the major differences between early inte¬ 
grated work and current approaches are those that 
have resulted from an increase in computing power 
which, in turn, has enabled integration to take 
place in a spatially explicit manner. The modern 
emphasis on spatial ecology not only recognizes the 
importance of spatial scale and structure, but also 
the significance of dispersal processes in affecting 
everything from the extent of phenotypic plasticity 
to the position of range edges and the structure of 
assemblages (Hubbell 2001; Holt et al. 2004b; Veldt- 
man & McGeoch 2004; Chown & Terblanche 2007; 
Angilletta 2009; Gaston 2009; Chapter 11). Indeed, 
a spatially explicit approach may prove to be one 
of the most useful ways of integrating biodiversity 
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measurements across a range of hierarchical levels 
(Hui et al. 2006). Such an approach is proving neces¬ 
sary not only to understand patterns of biodiversity 
variation and the mechanisms underlying it, but 
also to understand how both may change through 
time owing to landscape change, landscape man¬ 
agement, responses to unintended or intentional 
human influences, or to evolution in response to 
these changes. 

In the context of biodiversity surveys. Bell (2003) 
recognized the value of between-site differences in 
providing information on spatial variation in traits 
in the biodiversity matrix. A substantial amount 
of information may be obtained from consider¬ 
ing row and column totals and variances from a 
species by sites (r x c) matrix, including richness 
variation, variation in abundances of species, and 
the extent to which assemblages may be nested 
(see Chapter 11 and Simberloff & Connor (1979) 
for an earlier and related matrix approach, i.e. Q- 
mode and R-mode analysis). Although Bell's (2003) 
assessment was concerned with either presence- 
absence or abundance data, the cells of the matrix 
may also take the form of other characters, such 
as body size. Indeed, when this is done it imme¬ 
diately becomes apparent that the r x c approach 
provides a means of distinguishing between spa¬ 
tial variation in traits at the intraspecific, interspe¬ 
cific, and assemblage levels, and understanding the 
relationships among them (Gaston 2008a; Fig. 18.3). 
For example, whilst intraspecific clinal variation in 
size probably has to do with temperature, water 
availability, season length, organism size, and gen¬ 
eration time (Chown & Gaston 2010), interspecific 
variation is also influenced by range position and 
taxonomic replacement, whilst assemblage varia¬ 
tion includes these influences as well as variation in 
species richness (Greve et al. 2008; Olson et al. 2009). 
Dray and Legendre (2008) take the matrix approach 
a step further by demonstrating how the linkage 
(using an extension of the fourth-corner method) of 
a species abundance matrix with an environmental 
and phenotypic matrix (behavioural or other trait 
matrix of interest) can be used to describe the multi¬ 
variate relationships between species, species traits, 
and the environment. 

The r x c approach can also be applied to phys¬ 
iological variables, such as desiccation resistance. 


metabolic rate, or thermal tolerance (Gaston et al. 
2009). Likewise, the complexities of responses in the 
presence of other species may be incorporated into 
the r x c approach (see de Mazancourt et al. (2008) 
and Angilletta (2009) for examples). Once this is 
done, and it is recognized that the site columns 
can readily be decomposed into two or three spa¬ 
tial dimensions ( x, y, z coordinates) (i.e. be made 
spatially explicit; see Hui et al. (2010) and Chap¬ 
ter 11), it rapidly becomes dear that the matrix 
approach can be used to represent landscape vari¬ 
ation (e.g. variation in mean daily maximum tem¬ 
perature or soil type), a function that relates this 
variation to survival, reproductive rate, or abun¬ 
dance (Chown & Gaston 2008; Angilletta 2009), and 
finally to represent abundance variation within and 
across taxa (Fig. 18.3). Moreover, the function relat¬ 
ing the environment to abundance may also take 
some other form, such as the approach set out as the 
fourth-comer problem (Legendre et al. 1997; Dray 
& Legendre 2008) or as a spatially explicit popula¬ 
tion viability model. In turn, the matrices may be 
stepped through time, as spatially explicit popu¬ 
lation models often do, with the landscape matrix 
changing, the functional matrix evolving, or both, 
to provide predictions of what the outcome of these 
changes might be. At large scales this approach is 
very similar to mechanistic niche modelling devel¬ 
oped by Kearney and Porter (2009); at smaller scales 
it is equivalent to spatially explicit population via¬ 
bility assessment, whilst at the field scale it can be 
seen as similar to pest forecasting models. Because 
the spatial autocorrelation structure of the environ¬ 
ment, and how individuals respond to it, may also 
lie at the heart of larger-scale patterns, such as the 
species-area relationship, species abundance distri¬ 
butions, and the occupancy-area relationship (e.g. 
Hui & McGeoch et al. 2007a,c; Storch et al. 2008), 
this approach also provides a means to understand 
how environmental variation may be responsible 
for generating these macroecological generalities. In 
this regard, perhaps the most well-acknowledged 
risk of inferring process from spatial pattern is that 
both different and multiple processes may generate 
similar patterns. Mclntire and Fajardo (2009) out¬ 
line an approach for linking spatial pattern and 
process, and maximizing the inference that can be 
achieved using space as a surrogate for unmeasured 
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Figure 18.3 (a) An r x c matrix of species by sites indicating how physiological variables may be included in such a matrix and can provide insight into 
intraspecific, interspecific, and assemblage-level variation. The variable here is critical thermal minimum (see Chown & Terblanche (2007)). The italics 
indicate intraspecific variation across space. Interspecific variation is calculated as a mean value for the species at the centre of its range, whilst 
assemblage characteristics are the mean and variance of a trait across all species at a given site (see Gaston et al. (2008a)). (b) A schematic indication of 
how the former matrix may be made spatially explicit to investigate the relationships between environmental variation (shown in the lower matrix), 
performance (an upper lethal temperature figure is shown), and abundance variation. 


environmental variables. Likewise, a null model 
approach based on r x c matrices, which may be 
used as an alternative or complementary one to 
that described above, has been elaborated by Gotelli 
et al. (2009). 

Managed landscapes represent excellent oppor¬ 
tunities for understanding biodiversity pattern and 
process and its maintenance, as well as for testing 
new methods of biodiversity measurement because 
possibilities for replication and semi-controlled 
experimentation are often more practicable than 
in more natural settings. How useful these new 
developments in matrix approaches might prove 
to be in the context of understanding biodiversity 


in managed landscapes is yet to be determined. 
However, given their considerable utility in other 
settings, we are of the view that they will provide 
considerable insight. 

18.3 Prospectus 

By 2050 the human population is expected to reach 
9 billion, and along with this increase our influence 
on the planet will also rise (Millennium Ecosys¬ 
tem Assessment 2005). In consequence, biodiver¬ 
sity measurement seems set to be undertaken in 
an increasingly human-dominated world, a world 
that has fewer species, less available energy for 
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those that remain, and sharper edges, as more areas 
are managed for agriculture and for fuel produc¬ 
tion (Sutherland et al. 2008). Some areas will also 
become home to entirely novel ecosystems—those 
which are highly modified but not highly managed 
(and to which we have admittedly paid little atten¬ 
tion here; see Hobbs et al. (2006)). Future biolo¬ 
gists undertaking biodiversity measurement will no 
doubt have to cope with these challenges. Perhaps 
the best lesson that current measures of biodiver¬ 
sity in modified landscapes has taught us is that 
much more attention should be given to mitigation 
of our impacts, rather than to attempts to adapt to 
them. 

18.4 Key points 

1. Humans currently dominate much of the globe, 
both on land and at sea. In consequence, the 
measurement of biodiversity often concerns bio¬ 
diversity in managed landscapes. 

2. Objectives for measuring biodiversity typically 
differ depending on land-use type and its asso¬ 
ciated degree of landscape transformation and 
management intensity. 

3. In managed landscapes, the biodiversity to be 
measured may range from the level of genes 
and transcripts, through populations, to assem¬ 
blages and metacommunities. No particularly 
special set of methods is required for managed 
landscapes, beyond those typically advised for 
a particular level of the biological hierarchy. 


However, the form, structure, grain, and varying 
age of the elements of the landscape should be 
considered carefully, as should the scale or scales 
at which the study should be undertaken and the 
sampling methods required to address the issues 
at hand. 

4. Bioindicators should be developed with a focus 
on clear objectives, ease of sampling, cost- 
efficacy, and taxonomic knowledge. The indica¬ 
tor and variable of interest should have a sig¬ 
nificant, strong, and robust relationship, and the 
value of the former will be enhanced if it is rep¬ 
resentative and general. 

5. An r x c approach to biodiversity measurement, 
across the full range of managed and relatively 
unmanaged landscapes, provides a means to 
explore and integrate both patterns in and the 
mechanisms underlying variation in biodiver¬ 
sity across the full biological hierarchy. Sev¬ 
eral complementary approaches now exist that 
are based on such matrices, which at the finest 
resolution can be thought of as continuous 
data. 
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CHAPTER 19 


Estimating extinction with the fossil 
record 

Peter J. Wagner and S. Kathleen Lyons 


19.1 Introduction 

Many ecological and palaeontological studies focus 
on extinction. The fossil record is particularly 
important for studying long-term patterns in 
extinction: although analyses of extant phytoge¬ 
nies can estimate extinction rates (e.g. Alfaro et al. 
2009) and even suggest mass extinctions (e.g. Crisp 
& Cook 2009), they cannot imply trilobites ever 
existed or that sphenodonts (now represented only 
by the tuatara) were once as diverse as lepi- 
dosaurs (lizards and snakes). However, workers 
also use the fossil record to test ideas about the 
pace of major extinction events that use meth¬ 
ods similar to those that conservation biology 
might use. Here we will review current palaeobi- 
ological methods for inferring extinction patterns, 
spanning 'traditional' methods using stratigraphic 
ranges to methods using more exact information 
about distributions of finds within stratigraphic 
ranges. 

Palaeontological studies of diversity patterns also 
focus extensively on standing richness and origi¬ 
nation rates. We do not focus on either of these 
parameters for their own sake. However, stand¬ 
ing richness (usually referred to simply as 'diver¬ 
sity' in palaeontological studies) is an important 
parameter when discussing extinction as extinc¬ 
tion metrics necessarily rely on changes in stand¬ 
ing richness. Origination rates also can be an 
important parameter as the distribution of origi¬ 
nations within time spans (i.e. evenly throughout 
the interval vs concentrated in the beginning) have 
small but important effects on the predictions of 
extinction rate hypotheses. Moreover, palaeonto¬ 
logical techniques for estimating origination rates 


are essentially identical to those for estimating 
extinction, save that they are done 'in reverse'. 
Thus, discussions of how one estimates extinction 
rates using fossil data can almost double for dis¬ 
cussion of how one estimates origination from fossil 
data. 

The other parameter that will be important 
throughout this review is preservation (i.e. sam¬ 
pling) rate. Just as ecologists know that incom¬ 
plete sampling affects sampled richness (e.g. Hurl- 
bert 1971) and implied extinction (e.g. Solow 
1993a), palaeontologists know that incomplete sam¬ 
pling affects the first and last appearances of fos¬ 
sils, both in local sedimentary sections (Signor & 
Lipps 1982) and globally (Sepkoski 1975). Work¬ 
ers have expressed concern that major extinctions 
are exaggerated by or possibly even illusions of 
intervals of poor preservation (Raup 1979; Smith 
et al. 2001; Peters & Foote 2002). Because sam¬ 
pling intensity is of interest to both fields, we shall 
also discuss how palaeontological studies address 
sampling when estimating diversity and diversity 
dynamics. 

19.2 State of the field 

19.2.1 Basic metrics 

Palaeontological studies of richness and diversity 
dynamics (i.e. extinction and origination) date back 
to the 19th century (Phillips 1860). Traditional 
palaeontological studies use synoptic databases (e.g. 
Sepkoski 1982, 2002), which catalogue the first 
and last appearances of taxa. These almost always 
are binned into chronostratigraphic units: typically 
stages and substages, but sometimes as fine as 
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I II V 



Figure 19.1 Chronostratigraphic ranges of five hypothetical taxa. Each 
f gives a separate chronostratigraphic unit, with f t the oldest. Dashed 
vertical line for V indicates that taxon V is not actually found in unit f 2 . 
Modified from Foote (2000), Fig. 1. 

faunal zones. (We use 'intervals' to refer chronos¬ 
tratigraphic units because of the correspondence 
between units of time and strata; see Grad- 
stein et al. (2005)). For numerous reasons, synop¬ 
tic studies typically use supraspecific taxa, with 
genera being the most common taxon since the 
mid 1980s. 

Fig. 19.1 presents a simple example. We use 
last appearances in each unit t to infer rates of 
extinction. Two types of metrics commonly are 
used: per taxon (pt) and per capita (pc; see Foote 
(2000)). Per-taxon metrics measure the propor¬ 
tion of standing richness, S, that have their last 
appearance in an interval. Per-taxon extinction 
thus is: 



where Sp is the number of taxa last appearing in an 
interval. In Fig. 19.1, taxa III and IV last appear in 
h, so S l = 2. S is not the sum of taxa sampled in 
an interval, but the sum of taxa with chronostrati¬ 
graphic ranges spanning the interval. Taxon V in 
t 2 illustrates the difference: because it first appears 
before t 2 and last appears after t 2 , we assume that 
it existed in t 2 even though V is not sampled in 
t 2 . This assumption is unsound only if taxon V is 
polyphyletic. As the synoptic data for taxa I and V 
are identical, we need a more detailed database to 
recognize cases like taxon V. Thus, S t2 = 5 regardless 
of how many 'range-through' taxa (e.g. I and V) 
actually are sampled in t 2 , and ju pt = 0.40. (Per-taxon 
origination, A pt , simply replaces Sl with Sp, the taxa 
first appearing in t 2 ). 


Per-capita extinction rate measures the survivor¬ 
ship of taxa present at the outset of an interval. 
This is: 

^ = h(A) (19.2) 

where S a is the number of taxa crossing the bound¬ 
ary from the prior interval and S a2 is the number 
of the taxa also crossing through both the base 
and the top (i.e. the range-through taxa). Note that 
'per capita' here refers not to individuals within a 
species, but to taxa as operational units. For t 2 in 
Fig. 19.1, S a is 3 (taxa I, III, and V) and S az is 2 
(taxa I and V), giving = 0.41. The most likely 
probability of a taxon becoming extinct in an inter¬ 
val is now 1 — e^. However, this is identical to 
p pc if we restrict per-taxon rates to only boundary 
crossers. 

Chronostratigraphic intervals vary considerably 
in temporal length (Gradstein et al. 2005). Accord¬ 
ingly, workers often divide rates by interval length 
to get rates per million years instead of rates per 
interval. This is appropriate if extinction is dis¬ 
tributed throughout an interval. One expectation 
of continuous turnover rates is that longer inter¬ 
vals should have higher extinction rates. However, 
Foote (1994) showed that 'raw' extinction metrics 
are random with respect to the length of chronos¬ 
tratigraphic units, whereas per million year rates 
show a negative correlation. We will return to this 
issue below when we summarize tests of pulsed vs 
continuous rates. 

The distribution of extinctions and originations 
plays a role in whether we should prefer per-taxon 
or per-capita extinction rates. If extinction and orig¬ 
ination happen throughout an interval, then per- 
taxon rates overestimate immediate extinction risk 
as well as exaggerating S from any one slice of 
time. 'Singletons,' that is those taxa known from 
only one interval, such as taxon IV in Fig. 19.1, 
exacerbate this problem. Singletons often reflect 
differences in research effort and/or available fos¬ 
sils (Raup & Boyajian 1988) and excluding single- 
tons greatly reduces the volatility of per-taxon rates 
(Alroy 1996). Although per-capita rates are excel¬ 
lent for describing continuous rates, they still can 
accurately reflected pulsed extinction. 
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19.2.2 Survivorship curves 

Per-capita rates lead conceptually to survivorship 
curves. The per-capita rate for any one interval 
reflects the proportion of taxa expected to survive 
the entire interval into the next interval. Survivor¬ 
ship curves reflect the proportion of taxa that sur¬ 
vive multiple successive intervals. Some survivor¬ 
ship analyses contrast rates among higher taxa in 
order to assess whether extinction rates in some 
groups differ markedly from those in other groups 
(e.g. Simpson 1944,1953) or whether there are com¬ 
mon patterns among higher taxa (Van Valen 1973, 
1979; Raup 1991). Alternatively, cohort analyses 
contrast sets of taxa that originate in the same inter¬ 
val (i.e. Raup 1978; Foote 1988) to examine whether 
extinction rates change markedly over time. Given 
a per-taxon extinction rate, we expect the pro¬ 
portion of taxa that survive N intervals, /(N), 
to be: 

f(N) = ^ pc = e~ N ^ (19.3) 

Given ,u pt = — ln(|) from our example above, we 
expect 66.7% of the taxa to survive 1+ intervals, 
44.4% to survive 2+ intervals, etc., and thus 33.3% 
of the taxa to have durations of 1 interval, 22.2% 
to have durations of 2 intervals, etc. Of course, in 
reality this will vary even if extinction rates are con¬ 
stant simply by chance. However, the slope of the 
logged frequency of taxa with durations of 1, 2, 3, 
etc. intervals plotted against duration approximates 
the average pi pc , 

Foote (1988) uses Monte Carlo simulations to 
demonstrate that survivorship slopes for cohorts of 
Cambrian trilobites are significantly steeper than 
those for cohorts of Ordovician trilobites. (Subse¬ 
quent revisions to the timescale greatly reduced the 
average longevity of Cambrian trilobites, making 
the pattern more pronounced than Foote reports; 
Bowring et al. 1993). Raup (1991) also uses this 
approach to demonstrate that survivorship slopes 
for Phanerozoic cohorts fit the expectations of a 
general 'kill curve' of extinction rates. 

19.2.3 The importance of sampling 

For three reasons, equation 19.3 will not predict the 
distributions of stratigraphic ranges even if n pc is 


constant over time and among taxa. First, if we 
measure stratigraphic ranges in discrete bins (as is 
almost always done), then f(N = 1) = e~^ c only if 
all taxa originate at the very base of intervals. If 
origination is continuous throughout intervals, then 
we expect (for example) half of the taxa lasting one 
half an interval to span from t x to t, + i. Thus, only 
the slope from taxa with ranges of 2+ intervals will 
reflect fx pc under continuous diversification. Second, 
extinction rates do not make predictions about the 
ages of extant taxa: instead, the age distribution of 
contemporaneous taxa (i.e. a 'backwards survivor¬ 
ship' curve sensu Pease (1988) or a 'prenascence' 
curve sensu Foote (2001b)) reflects origination rates 
(Foote 2001b). 

The third reason is the most critical: extinc¬ 
tion rates make predictions about temporal dura¬ 
tions, but we can observe only stratigraphic ranges. 
As preservation rates strongly affect stratigraphic 
ranges, 'survivorship' curves using fossil ranges 
actually reflect both preservation and extinction 
(Sepkoski 1975; Pease 1988; Foote & Raup 1996). 
Consider a simple example of two clades where 
origination and extinction occur at the beginning 
and end of each interval, and for which fj. pc = 0.69 
(Fig. 19.2). For one clade, the probability of recov- 

Q Duration (p pc = 0.69) 

□ Range (R = 0.50) 



Intervals 


Figure 19.2 The relationship between durations (time from birth to 
death) and stratigraphic ranges (= number of chronostratigraphic units 
from first to last appearances). This simulation of 5000 taxa uses 
ix pc = 0.69, which predicts half of the taxa present at the outset of an 
interval to go extinct in that interval. Sampled stratigraphic ranges reflect 
sampling intensity (/?) of 0.5 and 0.05 per interval. Following 
palaeontological studies, both durations and ranges are binned into 
discrete intervals. 
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ery, R, is 0.5 per interval; for the other, R = 0.05 
per interval. These illustrate two patterns empha¬ 
sized by Foote and Raup (1996). First, the slopes 
of the logged survivorship curves after the first 
interval parallel each other quite well. Second, 
the proportion of taxa sampled from 1 interval 
to those sampled over 2+ intervals increases as R 
decreases. At sufficiently low R, all taxa are known 
from single finds (and thus single intervals) only. 
From this, Foote and Raup derived preservation 
intensity as: 


Rfr - 


5sr=l X S sr -3 


(19.4) 


where S sr=x is the number of taxa with stratigraphic 
ranges of x intervals. In our example here, R fr = 
0.53 when true R = 0.50 and R F r = 0.09 when true 
R = 0.05. 

The ability to contrast preservation rates over 
time or among taxa obviously interests palaeontolo¬ 
gists. However, subsequent work embraced the fact 
that joint hypotheses of preservation and extinction 
make exact predictions about distributions of strati¬ 
graphic ranges. This led to a shift from contrasting 
slopes of survivorship curves to contrasting the 
likelihoods of hypothesized extinction and preser¬ 
vation rates (Solow & Smith 1997; Foote 1997). 
Consider a simple system where origination and 
extinction occur at the beginning and end of inter¬ 
vals. Extinction here is a per-taxon metric giving 
the proportion of taxa expected to become extinct 
at the end of each interval. The expected proportion 
of taxa with a stratigraphic range ( sr ) of 1 interval 
is: 


W 

E [PM( 1 - PMY d ~ l) x d(l - R) (d ~ sr) R] 

flsr = 1] = - 

1 - E [PW(1 - P[/J) w - 1) x (i - ry] 

d =1 

(19.5a) 


while the expected proportion of taxa known from 
x = 2, ..., w intervals (where w is the maximum 
possible) is: 


/[«■]=£ 


JPM( 1 -PM)“'- 1) x ( 1 + d - sr)(l - Rf~ 5r) 

(sr-2-n)£(n+2) 




l-£[ p M(l - PW) w_1, x(l - R) d ] (19.5b) 

d =1 


where d is the true duration and n is the number 
of intervals between the first and last interval in 
which a taxon might be found (Foote 1997). The 
three important recurring terms here are: 

ju(l — nY~ l : the probability of avoiding extinction 
for d — 1 intervals and becoming extinct in 
interval d, where ft is equivalent to ft pc for 
boundary crossers only; 

(1 — R) x : the probability of being unsampled for x 
intervals; 

R' J : the probability of being sampled in y intervals. 


The denominator in both equations estimates the 
proportion of taxa that are sampled as one minus 
the proportion that are unsampled, i.e. the prob¬ 
ability of having duration d times the probability 
of going unsampled for d intervals summed over 
the entire range of possible durations ( w ). This is 
because we can test hypotheses based only on their 
predictions about sampled taxa. The numerator also 
sums over the range of possible durations. The 
second part of both numerators reflects the num¬ 
ber of ways a stratigraphic range of sr fits within 
a duration of d, which is simply 1 + (d — sr). For 
example, there is only one way to fit sr = 4 within 
d = 4(h ,... ,U) and two ways to fit sr = 4 within 
d = 5(f-| f 4 and ti tj), etc. The final term 

in equation 19.5a, E ( ^ ^ ) (1 — R) (sr_ 2 _ " ) R ( " +2) , 

> 1=0 \ n I 


sums the probabilities given R of all of the different 
stratigraphic records that match a synoptic range 
of sr. All taxa with sr = 4 are found in at least two 
intervals (the first and last). However, there are four 
possible stratigraphic records consistent with that 
synoptic range (Fig. 19.3), therefore the likelihood 
function must sum the probabilities of all four pos¬ 
sibilities given any R. 
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Figure 19.3 Four exact stratigraphic distributions resulting in a 
synoptic range of four intervals. 


Foote (1997) presents more complex variations of 
equation 19.5 for continuous stratigraphic ranges. 
As most palaeontological studies use discrete bins, 
we will not cover this in detail. However, we will 
return to the effects of continuous y, and A (origi¬ 
nation) on the expectations for stratigraphic ranges 
below. At this point, equation 19.5 gives us an 
expected observed survivorship curve given any 
combination of fx and R. Multinomial probability 
then gives the probability of the data given that 
combination. However, this obviously is not wholly 
satisfactory for several reasons. In particular, this 
approach is useful for contrasting general differ¬ 
ences in average extinction rates between taxa while 
controlling for the possibility of different average 
preservation rates. Most of the interesting palaeoe- 
cological questions concern differences in extinction 
rates in particular intervals. We also have ample 
evidence that sampling varies over time (e.g. Peters 
2006; McGowan & Smith 2008), which might affect 
our estimates of extinction. 

19.2.4 Relevant studies 

Foote (2001c, 2003) modifies the approach outlined 
above to estimate interval-by-interval rates of /r, A 
and R. One applies variations of equation 19.5 on 
'birth' and 'death' cohorts, that is, how long do taxa 
first appearing in an interval last and how old are 
the taxa that last appear in an interval? In other 


words, instead of analysing a vector of ages at last 
appearance, one now analyses a matrix X where 
each cell X, ; gives the number of taxa first appear¬ 
ing in tj and last appearing in t, . Now, one can apply 
separate m and R, (and A, ) at each f, on variations of 
equation 19.5. The expected number of taxa origi¬ 
nating in f, with d = j now is affected by variation in 
fi from i to ;, and the probability of any given sr < d 
is affected by variation in R from i to j. Because the 
best R, for any f, must accommodate both the back¬ 
wards survivorship curves as well as the forwards 
survivorship curve. A, is relevant for estimating i±,. 
The algorithms to find the collection of parameters 
maximizing the probability of the observed matrix 
are computationally intensive. Foote's (2003) analy¬ 
sis of Sepkoski's (2002) compendium of marine gen¬ 
era shows, for example, that the major loss of taxa 
at the Big Five mass extinction events (Raup & Sep- 
koski 1982) and the high turnover of the Cambrian 
taxa (Sepkoski 1979) require high rates of extinction 
even in those cases where preservation rates were 
low. 

Finally, estimates of extinction and preservation 
from equation 19.5 effectively assume that taxa 
extant are present throughout all intervals within 
their durations. This can be true only if both orig¬ 
ination and extinction are pulsed, that is, concen¬ 
trated in some particular interval. However, if they 
are continuous, then true durations will span frac¬ 
tions of the first and last intervals for most taxa. 
This has two effects (Foote 2005). One, if origina¬ 
tion probabilities are uniform throughout intervals, 
then there is a 70% chance that a taxon with a true 
duration of 2.7 intervals exists in all or part of four 
intervals. Two, the probability of sampling a taxon 
in its first or last interval is not R, but 1 - (1 — R) x , 
where x is the fraction of the first/last interval in 
which the taxon occurs. Given this subtle differ¬ 
ence, Foote (2005) found strong support for pulsed 
extinctions and origination in most intervals. One 
implication of this is that the tradition of standard¬ 
izing extinction rates by millions of years often is 
highly misleading. 

19.2.5 Occurrence-based diversity estimates 

We require approaches such as outlined in the 
prior section when synoptic ranges are our only 
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Figure 19.4 Hypothetical distribution of finds in a section, based 
on a single simulation. 'Horizons' represent fossiliferous beds. Black 
circles indicate that the species is sampled. White circles indicate that 
it is unsampled. Hatched circles represent the first fossiliferous 
horizon following the extinction event. Gray ellipses give the observed 
stratigraphic ranges. Although all 12 species are extant throughout 
the time represented, they appear to go extinct (and originate) 
gradually. 


data. However, in many cases we have data con¬ 
cerning the distributions of finds within strati¬ 
graphic ranges, such as the Paleobiology Data¬ 
base (http://pbdb.org; see, for example, Alroy et al. 
(2001)). This provides us with an independent 
means of estimating R, using range-through taxa 
for any t,. In Fig. 19.4, four taxa are known to be 
extant during ti and f 3 because they were sampled 
before and after those intervals. This leads to a sim¬ 
ple estimate (Connolly & Miller 2001b): 

R, = (19.6) 

^i.az 

where S iaz is the number of range-through taxa for 
f, (see Fig. 19.1) and S, 0 b S is the subset of those taxa 
that we actually observe. If one uses empirical esti¬ 
mates of Rj instead of solving for R, at the same 
time as m (and A;), then it becomes much easier to 
solve for the diversity parameters. 

Foote (2007a) uses the Paleobiology Database to 
measure R with range-through taxa and then assess 
the effect of preservation on recalibrated extinc¬ 
tion rates. This suggested that extinction is less 
continuous through the Phanerozoic than previ¬ 
ously thought, as immediate R, explains the last 
appearances in many low extinction intervals. A 
corresponding result is that extinction rates of sub¬ 
sequent intervals increase, with the end result being 
that intervals with high extinction given 'face value' 


data have even higher extinction after optimizing 
for R. In particular, Foote's analysis suggested that 
extinction rates in the intervals preceding the end- 
Permian, end-Triassic, end-Ordovician, and end- 
Devonian were lower than suggested by the 'face 
value' fossil record. Thus, accommodating preser¬ 
vation elevates rather than erases mass extinction 
episodes. Nevertheless, and as Foote emphasizes in 
that work, the 'face value' data leads to the same 
conclusions, for example estimates of discontinu¬ 
ity of extinction rates separating a few intervals of 
mass extinctions from other intervals remains (see 
Wang 2003). 

Connolly and Miller (2001b) use the distribu¬ 
tion of genus finds over time to estimate preserva¬ 
tion, extinction, and origination likelihoods using 
a capture-mark-recapture (CMR) approach. This 
represents a modification of the CMR methods used 
to estimate the same parameters for Recent taxa 
given conservation data (Pradel 1996). One obvi¬ 
ous advantage of this sort of approach is that one 
can estimate diversity parameters for separate envi¬ 
ronments or biogeographic emits from the same 
stages (see, for example. Miller & Foote (1996) 
and Connolly & Miller (2001a)). Alroy (2000) uses 
likelihood estimates of these parameters to con¬ 
struct faunal ordinations that maximize the prob¬ 
ability of observed species lists. The further devel¬ 
opment of such techniques in conjunction with the 
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expansion of large occurrence-based databases (e g. 
the PBDB) represents an important future direction 
for palaeobiological studies (see Prospectus section 
below). 

19.2.6 Gap analyses 

The use of exact numbers of finds within strati¬ 
graphic ranges and the issue of continuous ver¬ 
sus pulsed extinction rates points the way to the 
most exact methods for evaluating the timing of 
extinctions. This is perhaps best known in associ¬ 
ation with Alvarez et al.'s (1980) proposal that the 
end-Cretaceous extinction is a sudden event trig¬ 
gered by a major asteroid impact. However, the 
idea that extinction events are pulsed rather than 
gradual is not new (see, for example, Erwin (2006)). 
Some theory suggests that lower-level turnovers 
often are pulsed (Vrba 1985; Morris et al. 1995) 
and many of our concepts about stratigraphic zones 
tacitly assume pulsed turnovers (e.g. Shaw 1964). 
Foote's (2005) study argues against 'gradual' extinc¬ 
tion mechanisms that distribute extinctions through 
entire stages. However, chronostratigraphic stages 
and substages typically are millions of years long 
whereas the 'gradual' extinction hypotheses often 
predict extinction over hundreds of thousands of 
years. Thus, although the 'pulsed' hypotheses of 
extinctions predict extinctions over tens of thou¬ 
sands of years or less, both 'pulsed' and 'gradual' 
might make the same predictions about synoptic 
data. We therefore must focus on the distributions 
of finds within chronostratigraphic units to find 
different predictions between gradual and pulsed 
extinction hypotheses. 

Preservation again is very important when ascer¬ 
taining the expectations of pulsed and gradual 
extinction hypotheses. The probabilistic nature of 
preservation and sampling means that we expect 
gradual disappearances of taxa from the fossil 
record even if all of the taxa share the same time 
of extinction (Signor & Lipps 1982). Simple simu¬ 
lations illustrate the Signor-Lipps effect (Fig. 19.4). 
Consider the first and last occurrences of 12 taxa 
over 21 fossiliferous beds (horizons), with the 12 
taxa forming a Zipf abundance distribution (y = 2.0) 
and having equal preservation probability per spec¬ 
imen. Although all 12 taxa are present throughout 


the section, they appear to phase out gradually sim¬ 
ply because of sampling failure: although the com¬ 
mon taxa are sampled up to the top of the section, 
the less common taxa have highly truncated ranges. 
Solow (1993a) notes a similar problem for conser¬ 
vation biologists: it is easy to miss rare species for 
5+ years by chance alone, so failing to sample one 
for a long period of time is weak evidence that it is 
extinct; conversely, it is difficult to miss a common 
species for any length of time, so failing to sample 
one is strong evidence that it is extinct. 

One advantage that palaeontologists have that 
conservation biologists do not is that they can con¬ 
tinue to sample all of the time planes. Ultimately, 
elevated sampling should reveal the rare taxa. 
However, this is too labour intensive and expen¬ 
sive to be done at many sites. Thus, we need solu¬ 
tions using the available data. This, in turn, leads 
to gap analyses, which attempt to use estimates of 
expected gaps between samples to evaluate both 
extinction and the Signor-Lipps effect. 

Strauss and Sadler (1989; see also Marshall 
(1990)) use a Dirichlet distribution to assess possible 
gaps between last appearance and true extinction at 
any given confidence interval (Cl) as: 

g = sr x ([1—Ci]-J& -l) (19.7) 

where g is the hypothesized gap, sr is the observed 
stratigraphic range, H is the number of horizons, 
and Ci is the Cl on one end of the stratigraphic 
range. Note that if we use continuous meters (e.g. 
Strauss & Sadler 1989) or time (e.g. Marshall 1990), 
then the stratigraphic range would be the last hori¬ 
zon minus the first horizon and single samples 
have no range; however, with binned horizons, the 
minimum range is 1. We use horizons here simply 
because the non-random distribution of horizons 
within units of strata or time require further mod¬ 
ifications to these approaches (Marshall 1994,1997; 
but see Prospectus section below). Thus, for taxon 
E,g 5 o% is 18 horizons x([l - 0 . 5 ] - z=r - l) = 2.2 hori¬ 
zons beyond the last appearance. In other words, 
there is a 50% chance that the true extinction hap¬ 
pened between the last appearance and this point 
and a 50% chance that the true extinction happened 
after this point. 
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Marshall (1995) proposes using 50% confidence 
intervals to test the null hypothesis of simultaneous 
extinction. Marshall uses the range that is above 
g 50 % for half of the species as the optimal extinction 
zone. In Fig. 19.4, half of the taxa have £ 50 % < 2.2 
and half have # 50 % >3.1. Thus, the 22nd horizon 
would be the optimum extinction horizon. As that 
follows the last appearance of all of the taxa, we 
cannot reject a single extinction. Using metres of 
sediment rather than fossiliferous horizons, Mar¬ 
shall (1995) shows that Cretaceous ammonoids not 
only fail to reject the simultaneous extinctions, but 
that the £ 50 % values do an excellent job of predict¬ 
ing the iridium layer terminating the Cretaceous. 
Conversely, Marshall and Ward (1996) demonstrate 
that the 50% line for Late Cretaceous inoceramid 
bivalves precedes the last appearance of nearly half 
the species. As many species clearly existed after the 
optimal extinction time for half the species, there 
almost certainly are multiple extinction events for 
inoceramids. 

Marshall's basic approach lends itself well to like¬ 
lihood tests of numbers of extinction events. Build¬ 
ing on a model initially presented by Solow (1996), 
Wang and Everson (2007) consider the likelihood of 
the hypothesized extinctions, Ci...s, of S taxa, noting 
that: 

S 

i[C: -isr, Hu.s] ex ( g _ ^ ; [ ) ‘ (19.8a) 

where F A is the first appearance of taxon i, and 
therefore: 

— F A + 1 = gi + sr, (19.8b) 

(i.e. the observed stratigraphic range, sr, plus the 
gap, g, implicit to the hypothesized extinction 6 ). 
Again, the '+1' is necessary only when stratigraphic 
ranges are in discrete rather than continuous units. 
Note also that the likelihood is 0 rather than a nega¬ 
tive number if g < 0 as this is an impossible hypoth¬ 
esis. For any taxon z, the likelihood decreases as 
gi increases. The number of parameters is equal to 
the number of different extinction events. Thus, the 
simplest (minimum varying parameter) hypothesis 
has all taxa sharing the same extinction events and 
the most likely single-parameter (single-extinction) 
hypothesis sets all C, to the latest last appearance 


(e.g. horizon 20 in Fig. 19.4). Conversely, the most 
complex possible hypothesis posits a separate 
extinction for each taxon. The single most likely 
version of this basic model is one in which gi = 
0 for all taxa, and thus can have as many differ¬ 
ing 6 values (and thus as many different parame¬ 
ters) as there are taxa. The support (log-likelihood 
rescaled to the maximum) for this hypothesis is 0. 
There are a large number of alternatives of inter¬ 
mediate complexity, for example one can examine a 
two-parameter hypothesis that posits two separate 
extinctions giving Ca .f at horizon 20 and Cg l at 
horizon 19. 

Log-likelihood ratio tests (e.g. Edwards 1992) 
now can assess whether the more complex hypoth¬ 
esis predicts the data significantly better than 
the simpler hypothesis. In Fig. 19.4, the sup¬ 
port (i.e. the log-likelihood rescaled to the most 
likely hypothesis) for 6 a l = —7.34. Whether we 
can reject this in favour of continuous extinction 
depends on how many parameters we ascribe 
to the best 'continuous' hypothesis. Wang and 
Everson (2007) use S on the grounds that there 
is one extinction for each of the S taxa. How¬ 
ever, in this example there are only seven last 
appearance beds. This means that we need only 
seven extinction events to eliminate all 12 taxa, 
and that the degrees of freedom should be 6 
(7 — 1) rather than 11. This is important because 
we would not reject the null hypothesis of a sin¬ 
gle extinction if we assume that the overall most 
likely hypothesis requires 12 parameters (p = 0.20) 
whereas we would if we assume that the over¬ 
all most-likely hypothesis requires seven parame¬ 
ters (p = 0.02). However, a two-parameter hypoth¬ 
esis (Ca...c=20, £d ...l = 19) represents a signifi¬ 
cant improvement over the one-parameter hypoth¬ 
esis (support = —4.45; p = 0.02) while withstanding 
rejection from the overall most likely hypothesis 
given the assumption of either 12 parameters (p = 
0.54) or seven parameters (p = 0.11) or from the 
most-likely three-parameter hypothesis (C A ...c = 20, 
Cd...f = 19, Cg...l = 18; support =-3.13, p = 0.10). 
This clearly is erroneous given that all 12 taxa share 
the same time of extinction in the simulations, but 
restricting all 12 extinctions to the final two beds 
strongly contradicts the idea of a prolonged extinc¬ 
tion event. 
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Information theory criteria give an alternative 
means for assessing the hypothesis. An appealing 
aspect is that these criteria can lead one to conclude 
that the simpler hypothesis is superior to the more 
complex hypothesis rather than simply unrejected 
(Burnham & Anderson 2002). Here, Akaike's infor¬ 
mation criteria (AIC; Akaike 1973), modified for 
sample size (Burnham & Anderson 2002), is: 

AICc = -2 InL[C a ...l] + 2R 

where K is the number of parameters and 
the taxon richness, S = 12, is the number of 
data points. Because the AICc[Ca...l = 20] = 17.1 
whereas AICc[7 separate C] = 42, we now prefer 
the single extinction to the mostly-likely seven- 
extinction hypothesis. However, we still prefer 
the best two-parameter hypothesis (AICc[Ca ...c = 
20, (d ..l = 19] = 14.2) to the best single-extinction 
hypothesis. On one hand, one might fault the 
method for failing to corroborate the correct 
hypothesis that all of the extinctions happened after 
the last bed. However, it is reassuring that the 
favoured hypothesis is a near miss that concentrates 
all of the extinctions in the last two beds. 

Huelsenbeck and Rannala (1997) estimate the 
probability of zero finds over some interval given 
preservation rate, R. Although this is intuitively 
reasonable, R itself is an unknown. R is not explic¬ 
itly addressed in the Wang and Everson approach, 
but uncertainty about R should be a concern there 
too. An intuitive estimate of Ris simply: 

R = - (19.9a) 

sr 

As Fig. 19.4 illustrates, this is biased towards overe¬ 
stimating preservation rates because the denomina¬ 
tor under-estimates the true duration (Wagner 2000; 
Foote 2001a). Instead, an unbiased estimate is given 
by the finds within the known range, i.e.: 

R = (19.9b) 

sr — 2 

Even so, we should not estimate the likelihood of 
hypothesized extinction as: 

(l — 


First, the uncertainty around the most likely R can 
be very large, especially for taxa sampled from a 
few beds over a short duration. Although the prob¬ 
ability of the stratigraphic data is maximized with 
equation 19.7b, the joint probability of the observed 
data and the hypothesized gap (and thus the joint 
likelihood of £ and R; Wagner 2000) is maximized 
at: 


giving the maximum: 

max L[C, R|sr, H] oc (1 - R) s x ([1 - R] sr “ H x R^ 1 ) 

(19.11a) 

a (1 - Ry* sr ~ H x R H ~ 1 (19.11b) 

where the first term in equation 19.9a is the proba¬ 
bility of zero finds over the gap and the second term 
is the sufficient statistic from the binomial probabil¬ 
ity of H — 1 finds over the observed stratigraphic 
range after the first find. (The binomial combina¬ 
toric is constant for all hypotheses.) Note that we 
now subtract only one horizon as one end of the 
'true' duration is implicit to the hypothesis and we 
truncate only the base. 

We now can estimate the likelihood of one or 
more extinction events using the product of equa¬ 
tion 19.11 for all species with H >2. Returning to 
our example in Fig. 19.4, the log-likelihood of a 
single extinction at horizon 20 for the 10 species 
with Hj > 2 is -18.70. The log-likelihood of each 
species going extinct at its last appearance increases 
only to —14.65, which is insufficient to reject the 
single C hypothesis even when assuming only 
seven-parameters ( p = 0.23) and which fares far 
worse given information theory (AICc[Ca ... l = 20] = 
39.8 vs AICc[Ca...l = LA a ...l] = 71.3). However, a 
hypothesis of extinctions at horizons 19 and 20 
still is preferable (lnL = 16.25) given either log- 
likelihood ratio tests (p = 0.03) or AIC (AICc[C a .c = 
20, C D ... L = 19] = 37.8). 

The potential advantage of this approach over 
the approach of Solow/Wang and Everson is that 
it allows one to assess whether preservation rates 
change within a stratigraphic range. If so, then 
change in R can be explicitly accommodated (see 
Solow 1993b; Wagner 2000). For example, a sim- 
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pie test of linearly increasing/decreasing R over 
horizons would simply test the exact sequence of 
finds and misses with changing probabilities of 
finds and misses. Alternatively, one might posit 
a sudden shift in preservation rate. If the fre¬ 
quency of misses increases/decreases, then the 
gap between last appearance and extinction will 
increase/decrease. For example, taxon I in Fig. 19.4 
shows an increasing recovery rate over time: it is 
found once over 10 horizons (2-11) then twice over 
the next five horizons (12-16). The most likely sin¬ 
gle R, = 0.2 (equation 19.7b) has a support of 
-7.51(= ln[0.2 3 x 0.8 12 ]). The best gradually shift¬ 
ing R, with R 2 = 0.05 —> R u = 0.34, increases the 
support only to —7.14. The best two-phase R, 
R 2 ...U = 0.1 and R12...16 = 0.4, increases the support 
only to —6.86. In neither case would we reject the 
idea that sampling was homogeneous throughout 
taxon I's range and thus we have no particular 
reason to distrust the likelihood given sampling 
over those 20 horizons. However, Wagner (2000) 
presents empirical examples where this approach 
does reject homogeneous R and thus leads to dif¬ 
ferent likelihoods on hypothesized extinctions. 

19.3 Prospectus 

The study of extinction can proceed in many direc¬ 
tions, many of which workers have tentatively 
explored already. In particular, the use of databases 
such as the PBDB to ascertain more exact tests 
of extinction pacing and preservation rates within 
taxa represents the logical next steps to the meth¬ 
ods described above. For example, using observed 
range-through taxa to estimate R provides an aver¬ 
age sampling intensity within an interval. This 
obviously will not represent all genera. With higher 
taxa in particular, species richness within a genus or 
family will strongly affect preservation rate, as will 
other factors, such as the geographic distributions 
of the species within the taxa. Foote (2007b) shows 
that the occurrences of genera typically follow a bell 
curve over time, that is, occurrences are low both 
in the early and late intervals of a genus' range 
and typically peak in the middle. Because genera 
originate throughout time, this means that different 
genera will be at different points in their bell curves 
in any given interval. Moreover, the peaks of these 


distributions (i.e. the maximum numbers of occur¬ 
rences) differ among genera. These two factors com¬ 
bined means that it should be possible to derive a 
distribution of preservation intensities within each 
interval rather than a single R 'per genus'. What 
is most intriguing is that we might use the bell 
curves themselves in a Bayesian context to assess 
extinction risk (see, for example, Solow (1993b)). In 
other words, extinction risk should be relatively low 
at the peak of an occurrence distribution, when a 
genus typically is widespread and fairly speciose 
(see, for example, Foote et al. (2008)). Conversely, 
the extinction risk should be relatively high near 
the tail. Whether or not we wish to assume normal 
curves for occurrences over time requires further 
testing; however, data such as the PBDB make it 
possible to assess whether 'commonness' of genera 
is linked to extinction risk. 

Part of the reason why we should expect common 
genera to be less extinction prone than uncom¬ 
mon genera is that we expect geographically wide- 
ranging genera to be more common in the fos¬ 
sil record than contemporaneous geographically 
narrow-ranging genera (e.g. Jablonski 2000). How¬ 
ever, because sampling is not randomly distributed 
over geography, we cannot simply equate occur¬ 
rences with geographic range. Instead, we need 
to actually examine occurrences in particular geo¬ 
graphic units (see, for example, Foote et al. (2008)). 
Alroy (in Alroy et al. (2008)) presents methods for 
accommodating geographic dispersal in sampling 
standardized estimates of richness over time. We 
should adopt similar approaches for summarizing 
geographic distributions when examining the pre¬ 
dictions of alternative extinction models. 

Even occurrences within temporal and geo¬ 
graphic units obscure important information about 
abundances within localities. In general, there is a 
good correlation between abundances and occur¬ 
rences, in that taxa that are abundant anywhere 
within their geographic range tend to have many 
occurrences (Buzas et al. 1982; Alroy et al. 2008). 
However, in gap analyses of particular sections, 
variation in abundances of specimens has an impor¬ 
tant effect on the plausibility of missing an extant 
species (see, for example, Hurlbert (1971)). The sim¬ 
ulations sample equal numbers of 'specimens' per 
bed. However, in reality, all beds are not equal: due 
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to access, taphonomy, weathering, etc., some beds 
in the same section will yield many more speci¬ 
mens than others. Weiss and Marshall (1999) explic¬ 
itly accommodate this when examining extinction 
boundaries for late Cambrian trilobites. Similarly, 
McElwain et al. (2009) examine the end-Triassic 
extinction in plants not in loss of richness, but in 
shifts on most likely relative abundance distribu¬ 
tions. A combination of the two approaches would 
assess the likelihoods of hypothesized abundances 
over time/strata, with extinction being an abun¬ 
dance of 0. The logic of shifting R over a section 
would apply here, except that we now would ask 
whether we could sample X specimens without 
finding a species given that it is known from n of 
N specimens prior to that. 

Another promising avenue of research involves 
the use of ordination techniques to describe 
fossil turnover without assuming a particular 
chronostratigraphic scale (e.g. Alroy 1992, 
1994, 2000; Sadler and Cooper 2003). Although 
presented as techniques for repeatable, high- 
resolution chronostratigraphy, these methods also 
describe fossil distributions in the same manner 
needed for confidence interval studies, but at 
the geographic and geologic scales of synoptic 
studies. 

Finally, one might ask whether gap analyses 
of extinction events frame the basic hypotheses 
properly. As done above, the 'complexity' of the 
hypotheses is the number of extinction events. A 
single-extinction event thus is the null because it is 
only one event. Statistically, this is also appropriate 
because a single event necessarily is less likely than 
(as in Fig. 19.4) seven events, as it maximizes the 
minimum number of necessary gaps. However, this 


is not quite the appropriate null hypothesis if we 
look at extinction rates. A single event requires two 
rates: a 'background' rate of no extinction and a 
single pulsed rate of (nearly) 100% extinction. Thus, 
we can ask, what is the probability of X extinctions 
from Y remaining taxa given a hypothesized extinc¬ 
tion rate per bed? As Weiss and Marshall (1999) 
note, the extinction rate provides a prior probability 
of a lineage existing 1, 2, 3, etc., beds after its last 
find. This multiplied by likelihood of the gap gives 
a posterior probability of the taxon existing 1, 2, 3, 
etc., beds after its last finds. The sum of these poste¬ 
rior probabilities is what we would use as the count 
of the number of extinctions per bed. For example, 
in Fig. 19.4, we would have numerous extinctions 
in the last two beds given a hypothesis of uni¬ 
form extinction rates and the sampling data. This 
would be strong evidence against a gradual hypoth¬ 
esis. However, if last appearances were distributed 
more evenly throughout the section, then it would 
be much more difficult to extend the durations to 
the top of the section as not only would they 
become increasingly improbable, but the plausibil¬ 
ity of the taxon still being extant also would become 
increasingly improbable. 

19.4 Key points 

1. The fossil record provides data capable of testing 
a wide variety of extinction hypotheses. 

2. This can be done at very broad temporal scales 
as well as over intervals closer to 'ecological 
time'. 

3. Accommodating preservation is critical at all 
temporal scopes of palaeontological analyses of 
extinction. 



CHAPTER 20 


Estimating species density 

Michael L. Rosenzweig, John Donoghue II, Yue M. Li, 
and Chi Yuan 1 


20.1 Introduction 

20.1.1 The problem: what is the density 
of species? 

In 1969, students of biodiversity were astonished by 
the conclusion that the highest plant species diversi¬ 
ties in the world were not in tropical rainforests, but 
in the evergreen heathlands of Australia and South 
Africa (Richards 1969). This conclusion was based 
on estimates of species density obtained by taking 
total number of species, S, and dividing by total 
area, A. However, the conclusion was reversed by 
Parsons and Cameron (1974), who realized that the 
right way to compare diversities was to compare 
species-area curves: the system with the higher 
species-area curve is the more diverse—at any par¬ 
ticular area, it has more species. In contrast, as the 
reader will see below, species densities obtained by 
simply dividing the number of species by area can 
be extremely misleading. 

Yet, despite the high profile of the plant diver¬ 
sity discussion (above and repeated in Rosenzweig 
1995), we still find ourselves reading important 
papers that estimate species densities with the same 
faulty, linear method of division. For example, Stein 
et al. (2008) produced a valuable analysis of all the 
public lands in the USA. One of their goals was 
to compare the concentrations of species of con¬ 
cern in different types of public land. They con¬ 
cluded that lands belonging to the US Department 
of Defense (DoD) bear far more of the burden for 
species conservation than any other type of public 
land, including US national parks and forests. Much 
of the impact of that conclusion comes from a stun¬ 
ning bar graph (their Fig. 3) whose y-axis is species 

1 Equal participants listed in random order. 


density. DoD lands appear to have roughly three 
times the density of species of concern compared 
to lands of the US National Park Service (which are 
in second place). However, the species densities of 
that bar graph come from dividing the number of 
species by the amount of land, and they mislead 
badly. 

Indeed, the DoD does bear a heavy respon¬ 
sibility for conservation of the nation's species 
diversity. Stein et al. show that about 23% of 
species of concern reside on DoD land. The DoD 
must shoulder its responsibility in environments 
where it is extremely difficult to avoid habitat 
disruption (owing to target practice, tank train¬ 
ing, etc.), but, proportionally, its responsibility is 
not nearly as close to overwhelming as the bar 
graph suggests because the DoD manages only 
12.1 x 10 6 ha whereas the Park Service manages 
33.8 x 10 6 ha and the Forest Service 78.1 x 10 6 ha. 
Take any number of species, S, and divide it by 
those areas; it will seem the DoD has 2.8 times the 
Park Service's job and 6.5 times that of the Forest 
Service. 

20.1.2 Defining the density of species 

How can we get it right? How can we define species 
density so that our estimates of it will permit mean¬ 
ingful comparisons? The answer lies in using the 
very non-linear relationship SPAR that causes the 
problem to begin with. 

SPAR stands for SPecies-ARea relationship: As 
area increases, so does the number of species found 
in the area. Ecologists have investigated this rela¬ 
tionship for more than two centuries. It is so well 
established and so general that ecologists have 
called it a law (Lawton 1999b). 
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A SPAR is usually reported as a straight line in 
logarithmic space: 

log S = c + z log A 

where A is the area, S is the number of species, 
and the other two parameters are coefficients of the 
regression. Several exciting contributions have now 
demonstrated mathematically that SPARs, although 
they come close to being straight lines in logarith¬ 
mic space, do in fact have some curvature (Leitner 
& Rosenzweig 1997; Allen & White 2003; McGill & 
Collins 2003). Nonetheless, they are straight enough 
to justify our continuing to approximate SPARs 
with straight lines and to capitalize on the benefits 
of that approximation. One of these is that because 
the logarithmic version of the SPAR is linear, we can 
use it to measure species density. 

Consider the fact that a density or concentration 
of something is a quantity that stays the same no 
matter how much area or volume is surveyed. 
Thus, a population density of three gerbils per 
hectare means that 10 ha will have 30 gerbils, 100 ha 
will have 300 gerbils, 1000 ha will have 3000 gerbils, 
etc. One cannot do this for species density, however, 
without taking into consideration the fact that 
SPARs are strongly curved in arithmetic space. Sup¬ 
pose, for example, that the number of beetle species 


fits the formula log S = 1.3 + 0.1771og A Then 10 ha 
will have 30 beetle species, 100 ha will have 45 
beetle species, 1000 ha will have 68 species, etc. If 
we were to calculate the density of beetle species 
using simple division, S/A, we would obtain 
densities of 3, 0.45, and 0.068, respectively. Yet all 
of these results would have come from a taxon 
whose species diversity follows one single, simple 
equation, i.e. logS = 1.3 + 0.177 log A, so all three 
actually have the same species density! Fig. 20.1 
shows a graphical example of the misleading effects 
of using division to calculate species densities. 

Now we use the SPAR itself to recognize the 
quantitative property that stays constant in differ¬ 
ent amounts of area. That property is the coefficient 
c. To make this property most closely resemble an 
arithmetic density, we transform it to its antilog 
(being careful to use a single base of logarithms for 
all operations—we use 10). Now we can define a 
useful species density, C: 

C = 10 c = S/A 

The two coefficients of SPARs, C and z, are very 
different. The value z reflects the scale of the SPAR 
(Rosenzweig 2003). Values of z that come from com¬ 
parisons of biogeographical provinces exceed 0.55 
and often return numbers close to unity. Values of 



Figure 20.1 Comparison of two hypothetical samples. Both were generated by a SPAR with z of 0.20. The lower one, with C = 8, was sampled at 
50 units of area and yielded 17.5 species. The upper one, with C = 12, was sampled at 150 units of area and yielded 32.7 species. Because SPARs are 
curvilinear, raw density, 5/A, varies with the value of A at which one takes the sample. In this case, SIA is 0.350 and 0.218, respectively, that is, it is 
greater for the lower curve! SIA reveals nothing of the underlying biological properties of the samples. In contrast, the species density, 5/A\ is 
independent of A. In these two cases, it is 17.5/5O 0 - 20 (= 8.0), and 32.7/150°- 20 (= 12.0). 
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z from different sized areas of a single region are 
generally less than 0.23 and usually more than 0.12. 
Values from archipelagoes exceed the latter range 
but are less than those of provinces. This difference 
among z values makes theoretical sense and reflects 
the different biological processes at work at the dif¬ 
ferent scales. 

Other than varying with scale however, z values 
vary little. Perhaps we could use a convention to 
estimate C? Perhaps we should simply set all z val¬ 
ues to 0.2, then estimate species density as S/A 02 ? 
This already would have been a great improvement, 
but herein we shall try to do better than that by 
obtaining a data-based estimate of z. 

20.1.3 Species density takes on new 
importance in an era of environmental concern 

The importance of being able to make meaning¬ 
ful comparisons between species densities grew 
with the recommendation of the National Research 
Council (2000) (NRC) to rely on species densities 
for some basic environmental indicators. The NRC 
wanted these indicators to give policymakers a 
thumbnail sketch of the environmental state of the 
USA. The subtext is one of concern. Many fear our 
environment is spiralling out of control and that 
without transparent, simple, and objective indica¬ 
tors, policymakers will be hard-pressed to do any¬ 
thing about it. 

The NRC did not endorse the erroneous estima¬ 
tion of species density by linear division. Instead, 
it adopted a definition, C, based on SPARs and 
suggested by one of us (Rosenzweig) and restated 
above: 

C = S/A 2 


The NRC recommendation was expanded by 
Rosenzweig (2004). 

Neither publication, however, laid out a practi¬ 
cal method for estimating species densities. In this 
chapter, you will come to appreciate that estimat¬ 
ing C is far from trivial. We shall use a large plant 
dataset taken in Virginia to reveal the challenges 
of estimating species densities from field data. We 
shall also explore ways to circumvent the problems 
and arrive at reliable estimates, and we shall test 
our result. 

We begin with a description of the dataset and 
a first analysis that we conduct as if there were 
no challenges. Then we advance to recognize and 
confront them. 

20.2 Data set 

20.2.1 Data description 

Our data come from the Virginia Natural Her¬ 
itage Program (VNHP), which aims to save Vir¬ 
ginia's native plant, animal life, and ecosystems. We 
analysed its plant data only (Virginia Natural Her¬ 
itage Program 2006; DCR-DNH vegetation plots 
database, ver. 3.0. Virginia Department of Conser¬ 
vation and Recreation, Division of Natural Her¬ 
itage. Richmond). The plant data set has three parts 
(see Table 20.1): 

• A list of all 4205 plant species ever recorded in Vir¬ 
ginia. Each species listing includes its Latin name, 
common name, family name, global element rank, 
state element rank, and existing status. 

• Details of 652 study plots sampled by VNHP. 
Details include plot area, latitude, longitude, ele¬ 
vation, slope, ecoregion, ecological community 


Table 20.1 The Virginia Natural Heritage Program's plots have six different areas and belong to seven 
different ecoregions. 


Area (m 2 ) 

Number of plots 

1000 

50 

800 400 300 ; 

1 588 2 

200 100 

5 6 

Ecoregion 

Number of plots 

Ecoregion 

Number of plots 

Unknown 

1 

Cumberland/Southern Ridge &Valley 

17 

Central Appalachian Forest 

444 

Chesapeake Bay Lowlands 

10 

Piedmont 

155 

Lower New England/Northern Piedmont 

8 

Southern Blue Ridge 

13 

Mid-Atlantic Coastal Plain 

4 
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Box 20.1 WS2M 


Estimates is far and away the most widely used software 
for reducing sample size bias when estimating diversity. 
However, we used our own software, WS2M, for our 
analyses. WS2M is based on Estimates, yet, running on a 
Windows platform, it provides some features not available 
in Estimates. These include estimates based on several 
extrapolation formulae—which once again have proved 
their usefulness in our studies—as well as the version 
of the Michaelis-Menten extrapolator (MMf) that is fit by 
non-linear regression (rather than by maximum likelihood). 
WS2M computes jack-knife estimates based on all 
five jack-knife orders as specified in Burnham and Overton 
(1979). It also allows for analysis of presence-absence data. 

WS2M requires one of three pure ASCII input file 
types. In this work, we used a matrix input: each species is a 
column; each plot is a row. Each entry is the abundance of a 
particular species in a particular plot. Each entry is separated 
from the next by a space. The matrix is preceded by two 
ASCII lines, the first is any set of ASCII characters; we usually 
use it for a matrix name and to specify special goals for that 
matrix. The second is a pair of numbers specifying the matrix 
size, with number of columns (species) first. For example: 
Butterflies of Willow Slough 

34 6 

1 1 42 53 44 9 1 837 

41 759222 50 13 1 1 1 11 14 55200000000 
19 6 0 188 1263607 5 1 

23 29 43 50 3 3 257 26 9 0 5 5 9 5 24 10 1 2 2 3 0 0 0 0 

2 0 044474 18 0 167 1 0 

98 19 21 29 552 30 12079 13 15 196 1 00 1 0000 
6 00 175 293 14 0 16 

400 20 12 6 24 20 50 7153302 25 4220111000 
27 0 0 52 743 31 1 363 1 0 10 7 11 
25 4 11 1041 183821 18 12 19 11 153 1 1 0 3 0000 
400 128 146 38 0 18 

6 1 97 134 19 1 132 2 11 1 3 0631 45300000000 
The example above has the abundances 
of 34 species in six plots. (In this case a plot happens 
to be a year.) A second example shows the same records 
converted to incidence data (i.e. presence or absence): 
Butterflies of Willow Slough (presence or absence) 

34 6 

11111111111111111111111111000 

00000 

11011101111111111110111111111 
1 0000 


10011101101111111110111111100 

10000 

10011101100111111111101111101 

11000 

10011111101111111111111111110 
1 0000 

10011101111111111111101111000 

00000 

You must give all matrix files the 'prn' file extension and 
place them in the same folder as the program. WS2M 
depends on you to do those things. 

WS2M also does not know whether you are inputting an 
abundance or an incidence file. You must tell it by setting 
certain flags after you load your data matrix: 

• On the Actions tab, check 'Shuffle Sample Order' and 
'Shuffle Individuals'. 

• Then check 'Make Exchangeable’ and UNcheck 'Shuffle 
Individuals'. 

• On the Advanced Options tab, check 'Shuffle 
Incidences Only'. 

The few settings above are those required for the analysis 
of presence-absence data. Other settings are explained in 
the user manual available online at http://eebweb.arizona 
.edu/diversity/, where you will also find the program itself to 
download (ws2mb.exe). 

WS2M is research software. It is not commercial and not 
for the blithe. WS2IV1 will automatically warn you of some 
mistakes and automatically correct others, but not many. It 
relies on your professionalism, especially your willingness to 
decide in advance what you really want to do. It will not 
replace a magic wand. 

WS2M produces an output file that will allow you to 
design stunning presentations with your own favourite 
graphics package, but the graphs that it generates 
internally may not seem so useful. (One of the authors is 
colour-blind and the other two got tired of resetting the 
output colours of the software to compensate!) Also, the 
software exploits Windows at a very low level. That makes 
it very powerful and fast, but it also means that it inevitably 
runs into one of those infamous unfixed Windows bugs: 
when you dose the program Windows will tell you of an 
error. Ignore the warning. It's a Windows colour bug and 
has no effect on the output values. 

And, yes, WS2M runs under Windows Vista, too, at least 
on the 32-bit versions. 
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group, and detailed soil characteristic (pH, N, S, 
P, etc.). In all, 1001 plant species were recorded 
in the 652 plots. Each plot had 11.3 species on 
average. 

• A list of species in each plot and each species 
coverage score in each plot where it was recorded. 
Coverage score is a measure of abundance (which 
we discuss below), but no other measure of abun¬ 
dance is provided. Some species, 3402 to be pre¬ 
cise, were not recorded in any plot, but of the 1001 
species that were recorded, each species occurred 
in 7.4 plots on average. 

20.2.2 Data manipulation 

All data were in Microsoft Access, so we wrote 
code to build tables from which we could esti¬ 
mate SPAR. (All code for this chapter is available 
at evolutionary-ecology.com/data/speciesdensity.) 

One function of the code was to eliminate the 
coverage variable. All coverage values are integers 
from 0 to 9. They are not abundances. So the code 
converted all values greater than 1 into 1. Thus the 
data sets became presence-absence matrices. 

20.2.3 NP : our surrogate for A 

If we had abundance data, we could use them as a 
surrogate for area. The x-axis would be log N, that 
is, the logarithm of the number of individuals. But 
we do not have abundance data. Although coverage 
score is indeed a rough measure of abundance, we 
have no information about how it scales with abun¬ 
dance nor even whether the quantitative meaning 
of a unit increase in score from 1 to 2 (say) is equiv¬ 
alent to one of increase from 3 to 4. 

In data subsets where all plots had the same area, 
the number of plots, NP, was a perfect measure 
of A and we used it. But we also used NP as a 
surrogate for A in the entire data set, even though 
plots have six different A values. We did the same 
when analysing the plots of a single ecoregion. This 
is justified because we can and did accumulate plots 
in random order over 1000 replications per analy¬ 
sis. Thus we obtained SPAR for the average area 
of a plot. The reader should realize, however, that 
the area added by each additional unit of NP var¬ 
ied between analyses. It depended on the average 


area of the plots in a single plot of that particular 
analysis. Knowing that allowed us to replace NP 
with A when needed. 

To obtain each accumulation curve, we added 
plots one by one and counted the number of unique 
species at each step (with 1000 replications and 
using the WS2M software discussed in Box 20.1). 
Then, using linear regression, we analysed the 
mean values of log S vs log NP at each step to get 
our SPAR for that accumulation curve. 

20.3 Density estimates 

20.3.1 First density estimate 

For our first analysis, we lumped together all 652 
plots in the complete dataset regardless of their 
size or ecoregion. The average plot had 43.8 plant 
species. The result of the linear regression was 
log S = 2.06 + 0.34 log NP. So the plant species den¬ 
sity is 114.8 (i.e. 10 206 ). Its units are not spp/area, 
but spp/plot 0 34 . The R 2 of this SPAR is 0.96 and 
both its coefficients have probabilities in the region 
of 10~ 15 . 

However, Fig. 20.2 shows that we must refine 
our analysis because of systematic deviation from 
linearity. Predicted S values are much too large over 
small NP and too small over larger NP. Thus, the 
very high R 2 is misleading; it comes from the fact 
that NP is on a logarithmic scale so that very few 
points occur on the left side, where the fit is rather 
bad. 

In addition, we still have to deal with the lack of 
abundance data in plots of four different areas. We 
now turn to this. 

20.3.2 Density estimates for subsets with a 
uniform plot size 

Our x variable is NP, 'number of plots in the sam¬ 
ple'. Absence of a species in a plot of 1000 m 2 may 
not mean the same as its absence in a plot of 200 m 2 . 
The obvious and appropriate solution is to restrict 
each analysis to a set of plots of uniform area. 

The set of 588 plots of 400 m 2 is the largest and 
that of 50 plots of 1000 m 2 is the second largest. 
Let us analyse them separately and see how their 
results compare to those of the entire set of 652 
plots. 
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Figure 20.2 Naive estimate of plant species density in 
Virginia. All 652 sampling plots are included, although 
they have various areas and come from seven different 
ecoregions. Each plot is added in random order to an 
accumulation curve, the process is repeated 1000 times, 
and averages obtained. The result is a pronounced 
convex-upward curve with an estimated z value of 0.34 
and a plant species density of 114.8 per A 034 . 
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Density estimates for 400 -m 2 plots 
The average plot of 400 m 2 had 42.25 species. The 
set of 558 400-m 2 plots had 931 of the 1001 species in 
the entire data set. The linear regression was log S = 
2.04 + 0.34 log NP, so the plant species density is 
109.6 (i.e. 10 204 ), almost the same as 114.8, that is, 
that of the entire set of 652 plots (above). 

The R 2 of this SPAR is 0.96 and both its coeffi¬ 
cients have probabilities in the region of 10 -15 or 
even less. 

Density estimates for 1000-m 2 plots 
There were 50 plots of 1000-m 2 . Together they had 
551 of the 1001 species in the intire data set. The lin¬ 
ear regression was log S = 1.92 + 0.50 log NP, so the 
plant species density is 82.4 (i.e. 10 192 ), lower than 
either set that contains the 400-m 2 plots (above). 

The R 2 of this SPAR is 0.98 and both its coeffi¬ 
cients also have probabilities in the region of 10 -15 
or even less. 

Despite the impressive R 2 values, however. 
Fig. 20.3 shows a systematic deviation of the regres¬ 
sions from linearity just like Fig. 20.2. Predicted S 
values are too large over small NP and too small 
over large NP, but the deviations from the regres¬ 
sion of the 1000-m 2 plots are smaller than those 
of the 400-m 2 plots. That is because the 400-m 2 


plots cover a much greater range of NP values and 
include a large number of points far to the right- 
hand side of the graph where the relationship is not 
so steep. 

We confirm our explanation by reanalysing the 
first 50 points of the 400-m 2 results. We did not re¬ 
do the analysis; we just truncated it at 50 points. 
Now log S = 1.76 + 0.55 log NP(R 2 = 0.98)—a result 
much closer to that with the fifty 1000-m 2 plots than 
with the very data set from which the fifty 400-m 2 
plot points came. Species density is 57.6, that is, less 
than that for the fifty 1000-m 2 plots, a result that 
seems reasonable as we should expect fewer species 
in a smaller plot. 

The lesson of the 400- and 1000-m 2 plots 
The z value of the 1000-m 2 plots was 0.50 rather 
than the 0.34 of the 400-m 2 plots, but the first step 
in calculating a species density is to raise A (or its 
surrogate) to the power z. So if two sets of sam¬ 
ples come from different segments of a single curve, 
they will generate different z values (such as 0.50 
and 0.34 above) and thus different species densities. 
That is not acceptable. Hence, for species densities 
to be meaningful, all must be founded on the same 
z value. 

Thus, examination of the two plot areas sepa¬ 
rately has uncovered two problems: 
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Figure 20.3 An analysis for the set of 1000-m 2 
plots and another for the set of 400-m 2 plots. The 
400-m 2 plots yielded results similar to those Fig. 20.2, 
but the 1000-m 2 plots showed a steeper z and a lower 
species density. The accumulation curves of both analyses 
retain a convex-upward shape, which leads to variable 
estimates ofz. 


• the definition of species density implicitly 
requires that comparisons be made between 
entities that share a z value 

• curvature in the data points from which SPAR is 
obtained will lead to different estimates of species 
density if samples encompass different ranges of 
the curvature. 

Yet both these issues have curvature at their heart. 
Curvature is a serious problem. If logarithmic data 
follow a curve, a z value estimated from one part 
of the data set will differ from a z value estimated 
from another part. What causes curvature and how 
might one remove it? That is the topic of the follow¬ 
ing section. 

20.4 Curvature in SPARs 

SPARs arise from four sets of processes (Rosen- 
zweig 2003). Two of them are at work in the Virginia 
plant plots: 

• As area grows, so does the number of habitats 
included in the area. 

• Results from larger areas almost.always contain 
more individuals in the cumulative sample. 

Thus, when we measure the diversity of a place 
such as a 400-m 2 plant plot, two signals influence 
the result: the actual diversity and the sample size 
that we obtained to estimate that diversity. The 


smaller the sample size, the smaller the proportion 
of species we discover. Unfortunately, that creates a 
nagging statistical issue, sample-size bias. We wish 
to estimate the actual diversity and to suppress the 
negative bias of small sample sizes. This problem 
affects all efforts to measure diversity and diversity 
density. 

An accumulation curve that comes from study¬ 
ing one area by adding to its dataset over time 
reflects only its growing sample size. A SPAR based 
on such data is linear in logarithmic space (Rosen- 
zweig 1995). Similarly, a SPAR from a set of areas 
in which one has species lists that are virtually 
100% complete is also linear in logarithmic space. 
However, when the two processes combine to deter¬ 
mine a data set, the result will not be linear in 
logarithmic space. Its diversity, log S, will grow 
very fast over small sample sizes because log S 
reflects both signals. As sample size grows, bias 
declines, leaving only the signal of actual growth 
in diversity with area. We need to remove the sam¬ 
ple size signal as much as possible so as to lay 
bare the true SPAR and thus the underlying species 
density C. 

20.5 Reducing the bias 

Fisher et al. (1943) were the first to address the 
problem of sample-size bias. Their solution yields 
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an index (Fisher's a) of the number of species inde¬ 
pendent of sample size. But Fisher's a does not 
estimate the number of species itself. It remains a 
powerful investigative tool, especially to discover 
trends in local data, but it offers no standard to use 
for interpretation. 

Since Fisher, a number of other biologists and 
biometricians have addressed the sample-size prob¬ 
lem. (In fact, it has assumed a life that is inde¬ 
pendent of its biological origins, for example Efron 
& Thisted 1976.) One approach, rarefaction, has 
treated the problem by imposing a similar bias 
on all results (Chapter 4). Generally speaking, rar¬ 
efaction reduces the data available for comparisons 
by removing data from larger samples until all 
samples suffer equal bias. Thus, rarefaction does 
not so much reduce sample-size bias as surren¬ 
der to it. In addition, its comparisons come from 
the data-poor portion of a sample, the portion 
whose attributes mostly reflect the bias-freighted 
signal and have little to do with the diversity 
signal. 

Methods that do strive for bias reduction 
approach the problem in two very different ways. 
Some investigators work with estimators based on 
extrapolating S to an asymptote while others base 
their estimates on the frequency of scarce species. 

20.5.1 Extrapolation 

As sample size grows, the number of observed 
species rises monotonically towards an asymptote. 
This phenomenon is the accumulation curve itself. 
It is convex upward. The asymptote (or plateau) is 
the true number of species. 

Extrapolation attempts to estimate the diver¬ 
sity at the plateau by fitting an appropriate equa¬ 
tion template to the known part of the accumula¬ 
tion curve. Here are the minimal requirements for 
an appropriate template Rosenzweig et al. (2003), 
regardless of the values of its coefficients: 

• It traverses the point (1,1) because a single indi¬ 
vidual can belong only to a single species. 

• It has a positive first derivative for all sample 
sizes. 

• It has a negative second derivative for all sample 
sizes. 

• It has an upper asymptote. 


Michaelis-Menten extrapolation 
The first to suggest and use an extrapolation tem¬ 
plate was Holdridge et al. (1971). He noticed 
the similarity of the Michaelis-Menten formula 
to diversity accumulation curves. Holdridge fitted 
some data to this template and noted that they fit 
very well. 

The Michaelis-Menten formula has several other 
names in ecology but their parameters can always 
be manipulated to produce a Michaelis-Menten 
equation of the form: 

y = kx/(h + x) 

For purposes of diversity estimation, Michaelis- 
Menten becomes: 

S 0 = SN/Qi + N) 

where S is the true number of species. So is the 
observed number of species, N is sample size ( NP in 
our case) and h is a coefficient of curvature termed 
the half-saturation constant. 

Michaelis-Menten satisfies all but one of the cri¬ 
teria for an acceptable extrapolation formula. It 
does not traverse the point (1,1). Nevertheless, we 
include Michaelis-Menten in our set of methods. 
When data matrices are converted from abundance 
to presence-absence (as we have done for the Vir¬ 
ginia plant data), Michaelis-Menten often seems to 
work quite powerfully (Rosenzweig et al. 2003). 

An alternative set of extrapolators 
A set of extrapolation formulae that do satisfy all 
criteria belong to the following family: 

So = s a (1 - N^ (N) ) (20.1) 

where f(N) is any positive, unbounded, monoton¬ 
ically increasing function of N (which again is NP 
in our case). As N rises towards infinity, equa¬ 
tion 20.1 converges on S. In other words, the asymp¬ 
tote of equation 20.1 is S, the true diversity of the 
system. 

We used two functions f(N) in this study. We 
substituted them into equation 20.1 to produce two 
extrapolation estimators, F3 and F5: 

F3 uses /(N) = q In N 
F5 uses /(N) = qW. 

Like Michaelis-Menton, F3 and F5 have, in no 
sense, been derived from first principles. They 
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simply offer a template with the appropriate prop¬ 
erties. Unlike MM they even traverse the point (1,1). 

20.5,2 Estimators based on the frequency 
of scarce species 

The estimates of this class of methods come from 
looking at the accumulated total diversity and 
the number of locations in which each species 
occurs. 

Burnham and Overton (1979) introduced estima¬ 
tors of this type based on jack-knifing the data. 
They developed five different orders of the jack¬ 
knife, but they also recommended not selecting just 
a single order. Instead, they advocate calculating all 
five orders and choosing among them with certain 
statistical criteria. We followed their recommenda¬ 
tion in our work. 

Chao and her colleagues introduced another 
series of estimators (Chao 1984; Chao & Lee 1992; 
Chao et al. 1996). The first two have now clearly 
been superseded by the more recent ones. In our 
work, we used the incidence coverage estimator 
(ICE), but that is not to say we believe it to be the 
best one. It simply fitted the needs of our data in this 
case, allowed us to demonstrate the use of the Chao 
class of estimators, and was the most convenient 
to use. 

All estimators in this group have the common 
goal of estimating the number of species in a com¬ 
plete sample of a defined sampling universe. In con¬ 
trast, the extrapolators can also include unsampled 
habitats (if sampled ones are carefully arranged) 
(Rosenzweig et al. 2003). 

We employed these bias-reducing tools and they 
improved the estimate of C, as the reader will 
see. We are not, however, trying to write the last 
word on the matter. No doubt there will be further 
advances. However, we did want to explore how 
one might go about the task of measuring species 
density in practice. We trust that future and better 
bias reducers will fit well into the framework that 
we are about to describe. 

20.6 Applying bias reduction 

Using WS2M (Turner et al. 2000), we obtained 
bias-reduced SPARs from five overlapping data 


Table 20.2 Using bias-reduction estimators to generate z values, 
species densities (C values), and R 2 values for Virginia plants. 'Raw' 
signifies that no bias-reduction was used to generate that row of results. 
'CAP' are the 444 plots from the Central Appalachian Forest. See text for 
meaning of other treatment abbreviations. Bias reduction reduces SPAR- 
curvatures andz-values, and increases estimated species densities. 


z values 

All 

400s 

1000s 

CAF 

Piedmont 

Raw 

0.342 

0.344 

0.503 

0.350 

0.379 

ICE 

0.248 

0.243 

0.375 

0.229 

0.201 

B&O 

0.303 

0.283 

0.503 

0.299 

0.297 

F3 

0.265 

0.260 

0.348 

0.258 

0.299 

F5 

0.218 

0.210 

0.248 

0.204 

0.257 

MM 

0.215 

0.207 

0.217 

0.198 

0.227 

log C 

All 

400s 

1000s 

CAF 

Piedmont 

Raw 

2.06 

2.04 

1.92 

1.98 

1.93 

ICE 

2.44 

2.43 

2.34 

2.41 

2.41 

B&0 

2.34 

2.36 

2.16 

2.28 

2.24 

F3 

2.35 

2.34 

2.30 

2.29 

2.19 

F5 

2.48 

2.48 

2.49 

2.43 

2.28 

MM 

2.43 

2,42 

2.47 

2.39 

2.29 

R 2 

All 

400s 

1000s 

CAF 

Piedmont 

Raw 

0.962 

0.961 

0.983 

0.959 

0.974 

ICE 

0.912 

0.894 

0.717 

0.987 

0.937 

B&O 

0.927 

0.900 

0.872 

0.903 

0.817 

F3 

0.986 

0.985 

0.995 

0.984 

0.988 

F5 

0.998 

0.998 

0.999 

0.998 

0.991 

MM 

1.000 

0.999 

0.994 

0.998 

0.998 


sets (Table 20.2). Three are the same as previously 
analysed above. We took the fourth and fifth from 
the two ecoregions with more plots than the rest: 
the Central Appalachian Forest and Piedmont. The 
ecoregional data are already contained in the largest 
data set and many are also in the 400-m 2 and 
1000-m 2 plots. Grouping them specially by ecore- 
gion should avoid any heterogeneity in z values 
that might accompany plant association patterns in 
different ecoregions. 

Table 20.2 reveals some reassuring conclusions. 
First, the bias reducers eliminated most of the 
deviations from linearity. In fact, the R 2 values 
generated by both F5 and Michaelis-Menten are 
remarkable. All but two are 0.998 or more. The 
smallest is 0.991. Figs 20.4a and c show the SPARs 
for two of these 10 cases and Figs 20.4b and d 
show two others that did not remove as much 
curvilinearity. 
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(a) (b) 



log NP log NP 

Figure 20.4 The sample-size bias of raw SPARs is reduced, which in the best cases removes most of its curvature, (a) and (b) All 652 plots are included, 
(c) and (d) Only the 50 plots of area 1000 m 2 are included, (a) Correction by Michaelis-Menten template, (b) Correction by Chao incidence coverage 
estimator, (c) Correction by F5 template, (d) Correction by Burnham and Overton jack-knife estimator. In each graph we also plot the raw accumulation 
curve for comparison. 


Also reassuring was the uniform reduction of z 
values compared to the five uncorrected SPARs. 
Only the estimate from Burnham and Overton's 
jack-knife (0.503) is as large as that of the raw SPAR, 
and then only for the 1000-m 2 plots. Furthermore 
the z values produced by F5 and Michaelis-Menten 
(0.198-0.257) are quite similar to each other and also 
to what ecologists have come to expect of z values 
in nested samples within mainland regions (Rosen- 
zweig 1995). Quite possibly, matrices of presence- 
absence will be useful for generating z estimates in 
ecoregions. 


The log C values generated by F5 and Michaelis- 
Menten are also rather similar. We would perhaps 
be justified in concluding (for example) that the 
species density of plants in Central Appalachian 
Forest is about 257 species per (Area of plot) 0 2 . 
(Note: 257 = 10 241 .) 

20.7 Checking our results on the scale 
of all of Virginia 

We begin by determining the area of one unit 
of NP. The entire Commonwealth of Virginia has 
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110785 km 2 . The average plot in the dataset had 
442.1m 2 . So there are 110785000/442.1 = 262470 
plot units in the entire state. 

Extrapolating the Michaelis-Menten estimates 
for all sample plots to the area of the entire state we 
obtain: 

log S = 2.43 + 0.215 log 262470 = 3.595 
so S = 3936 plant species 
Extrapolating the F5 estimates, we obtain: 

log S = 2.48 + 0.218 log 262470 = 3.66 
so S = 4585 plant species 

Virginia has 4205 known plant species. Neither 
the F5 estimate (4585 species) nor the Michaelis- 
Menten estimate (3936 species) is very far from that. 
We note without comment that if we use their aver¬ 
age coefficients (2.455 and 0.2165), the result is 4248 
species, only 1% different from 4205. 

In short, using the extrapolation estimators the 
method succeeded even in the face of three serious 
obstacles: 

• no abundance data 

• disparate plot sizes 

• inadequate samples of five of Virginia's seven 
ecoregions. 

20.8 Why species density? 

20.8.1 Species density as an environmental 
indicator 

Policymakers need a reliable, efficient way to take 
the pulse of the environment. For this purpose, they 
need a very small set of environmental indicators. 
These indicators will not be exactly parallel to eco¬ 
nomic indicators, but they will be similar in num¬ 
ber (about 10), similar in being general, and similar 
in being able to reflect a nation's overall environ¬ 
mental condition without necessarily enumerating 
all its details. Species density promises to provide 
practical indicators of environmental condition that 
policymakers will be able to use. 

A committee of the NRC began work on this 
project in the 1990s and soon published its list of 
indicators, its reasons for choosing them, and its 
hopes to have them adopted (National Research 


Council 2000). But the NRC's hopes were not ful¬ 
filled. 

After the NRC began its work, the Heinz Cen¬ 
ter undertook a parallel project. Unlike the NRC 
committee, Heinz implemented its own recom¬ 
mendations and began publishing widely antici¬ 
pated reports on the state of the nation's environ¬ 
ment (The H. John Heinz III Center for Science 
2008). Perhaps as a cost of getting the necessary 
cooperation of environmental professionals all over 
the country, the Heinz list of indicators is roughly 
10 times as long as that of the NRC, so its reports 
are thick, data-rich, complex books rather than the 
one-page summary-style reports envisioned by the 
NRC committee. Although there is much overlap 
between the two lists, they differ substantially in the 
ways that they handle and report matters associated 
with biological diversity. 

NRC's indicators include two that are based on 
species density and use the same definition of it 
that we use here, then they compare densities to a 
standard. The standard is the density revealed by 
the taxon-specific SPAR of the natural landscape in 
the ecoregion. 

NRC's strategy is to estimate C for a particular 
taxon in an unexploited (read 'natural') habitat of 
an ecoregion, then to do the same for one type 
of land use. Let p signify pristine habitat and i 
any land use. Now consider the NRC indicator for 
native species density. C in the unexploited habi¬ 
tat, C p , will be the expected value of native species 
density. C in the utilized habitat, C,-, will be the 
observed native species density. The NRC indicator 
is simply C,/C f . 

Thus, the diversity-density indicators of the NRC 
explore new ground. This creates problems for their 
implementation. It will not be immediately clear 
to professionals how to measure their components, 
how to combine them, or how to interpret them. In 
addition, the NRC report was published with a seri¬ 
ous typographical error in the main formula for the 
most important indicator, an error which makes this 
indicator not just irrelevant, but impossible even to 
calculate. 

Although the error was corrected in a subsequent 
publication (Rosenzweig 2004), that publication 
also fails to address the problems of implemen¬ 
tation. This chapter has tried to explain how to 



ESTIMATING SPECIES DENSITY 287 


calculate diversity density—what data need to be 
collected, what problems those data will present, 
and how the data may be processed. 

One might fairly ask whether an indicator that 
requires so much initial effort has much value. We 
believe it does for several reasons. 

• There is nothing inherently difficult about the 
indicator. Instead, it relies on one of ecology's 
most familiar rules: larger areas have more 
species. Thus the initial familiarization period 
will probably be followed by a sense of comfort 
on the part of professionals. 

• Software is currently available to support imple¬ 
mentation. 

• The indicator closely parallels those in the River 
Invertebrate Prediction and Classification Sys¬ 
tem (RIVPACS) family (Moss et al. 1987; Wright 
2000) now in use in the UK, Australia, Canada, 
New Zealand, Sweden and the USA (Hawkins 
et al. 2000), but RIVPACS is much more labour 
intensive and so far has been demonstrated and 
used only in freshwater habitats. In contrast, the 
NRC indicator builds on patterns found in every 
habitat. 

The RIVPACS diversity indicator parallels the 
NRC indicator because RIVPACS is the ratio of the 
number of species found in a place to the number of 
species expected to be found there. In other words, 
the indicator is the ratio <observed/expected>, 
O/E. The NRC indicator uses this idea 
exactly. 

However, the RIVPACS determines £ with 
detailed examinations of the habitat distributions 
of each species. This is hard work, requiring expert 
attention, the measurement of many habitat vari¬ 
ables, and the separate analysis of each species' 
habitat distribution with respect to those habitat 
variables. 

In contrast, the NRC indicator relies on a well- 
established shortcut, the species-area relationship. 

20.8.2 Species density as a topic of study 

People—including ecologists—do like to make 
comparisons. Being able to state, with some con¬ 
fidence, that the butterflies (say) of ecoregion 
A have a higher species density than those of 


ecoregion B will surely please some of us. Being 
able to conclude that a planned change in the style 
of human use in ecoregion X has improved its 
species density of birds (say) will please even more 
of us. 

Having a working method for estimating C, we 
will be encouraged to search for C patterns among 
taxa, among biogeographical provinces, and among 
ecoregions. We know already that, unlike z, C varies 
a lot. We are already certain that (say) beetles of 
the Neotropics have a denser diversity than plants 
of New Zealand, but with our new quantitative 
tool, we should be able to make much more refined 
and interesting distinctions. How does aridity or 
productivity—or any other environmental property 
of an ecoregion—influence its C value? What char¬ 
acteristics might predispose a taxon to unusual C 
values? Do C values behave similarly in different 
biogeographical provinces, and so on? If we do find 
patterns, will we not be inspired to uncover their 
explanations? C opens a new book of macroecolog- 
ical investigations. 

20.9 Key points 

• To account for the curvature of SPAR (species- 
area curves), species density C is defined as C = 
S/A 2 , where S is the number of species, A is the 
area sampled, and z is the curvature of SPAR. 

• All values of S on a single SPAR have the same C 
value. 

• SPARs from raw counts of species reflect two sig¬ 
nals: negative sample-size bias and true increase 
in S with A. 

• Bias reducers straighten SPARs if those bias 
reducers rely on extrapolation of S to an asymp¬ 
tote. 

• The straightened SPARs yield a reasonable esti¬ 
mate of the total S in an area that can be consider¬ 
ably larger than the area in which the SPAR data 
were obtained. 

• C is a useful indicator of environmental condition 
and an interesting property for basic research. 
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Conclusions 

Brian J. McGill and Anne E. Magurran 


'A society is defined not only by what it creates, 
but by what it refuses to destroy.' 

—John Sawhill, former president/CEO of The 
Nature Conservancy 

As already reviewed in the Foreword and Chapter 
1, measuring biodiversity could not be more urgent 
or important. Diversity is being lost at a rate 
orders of magnitude higher than baseline 'nat¬ 
ural' extinctions (Millenium Ecosystem Assess¬ 
ment 2005). Society is concerned enough about 
this to band together and commit to preventing 
the loss of biodiversity with concrete goals by 
2010 (the United Nations Convention on Biodi¬ 
versity (CBD); available at http://www.cbd.int/ 
convention/convention.shtml). The CBD specifies 
what to measure (biodiversity) but does not spec¬ 
ify exactly how to measure it. This presumably 
is the job of ecologists such as those reading this 
book. 

Something not really recognized in the CBD 
but evidenced by this book is that the measure¬ 
ment of biodiversity is an active area of research— 
it is not a static technique that is well under¬ 
stood. In her 2004 book Measuring Biodiversity, 
one of us (Anne Magurran) surveyed the field 
of biodiversity measurement in five core chap¬ 
ters (excluding introduction and conclusions) but 
predicted that measurements of |3 diversity, func¬ 
tional diversity, phylogenetic diversity and genetic 
diversity would see rapid growth. This 2010 book 
on measuring biodiversity, albeit slightly differ¬ 
ent in intent and audience, confirms these pre¬ 
dictions. In comparison, the present book has 14 
core chapters and another five chapters of specific 
applications. The additional chapters include the 
prophesied topics as well as several unforeseen 
topics. 


In this book, and in the literature in general, we 
identify the following as some important trends in 
recent biodiversity research: 

1. Synthetic frameworks: One way to resolve the 
proliferation of different measures of biodiversity 
is to find the underlying mathematical unity that 
ties together many different approaches. The use 
of Hill numbers (Hill 1973) to join together many 
different measures of evenness and diversity 
partitioning as described and extended by Jost 
et al (Chapter 6) is one example. Information 
and variance views of evenness given by Maurer 
and McGill (Chapter 5) is another. 

2. Special cases: It should not be overlooked that 
the enormous diversity of the tree of life itself 
presents challenges. Different groups of organ¬ 
isms are more or less amenable to the standard 
paradigm of field-based censuses performed by 
the human eye on a community that is well 
defined in space and time. Different taxonomic 
groups and counting conditions require modi¬ 
fied methods (Chapters 16 and 19). 

3. Statistical approach: The recognition (Smith & 
Wilson 1996; Etienne 2005; Green & Plotkin 
2007) that even the data that come out of tra¬ 
ditional biotic censuses on traditional groups of 
organisms (e.g. birds, trees, fish) are not 'the 
reality' but an approximation of reality that 
needs appropriate statistical treatment is grow¬ 
ing (Chapters 2,3,5, 9 and 10). 

4. Spatial/temporal context: Biodiversity changes 
over time and space. This is part of what 
makes biodiversity fascinating. Quantifying this 
and placing biodiversity into its spatiotemporal 
framework is increasingly important. This paral¬ 
lels the growing importance of space in ecology 
broadly, where we are beginning to deal with 
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the fact that systems are not closed (Magurran & 
Henderson 2003; McGill 2003a; Zillio & Condit 
2007) and that it is a mistake to study points 
in isolation. Accordingly, biodiversity studies 
increasingly are exploring the relationships in 
biodiversity between points in space or time (e.g. 
Chapters 7,11,12 and 20). 

5. Comparative approach: We would argue that 
many biodiversity measures are most meaning¬ 
ful not in an absolute sense (what does a Simp¬ 
son evenness of 0.75 mean) but rather become 
meaningful mostly in a comparative sense (e.g. 
disturbed sites have lower evenness) (McGill 
et al. 2007; Chapter 9). Kempton and Taylor 
(1974) gave us a simple statistical tool to identify 
methods with maximum discriminatory power 
and Gray (1979) showed the efficacy of this 
approach for applied problems decades ago, but 
uptake has been slow. We are starting to see 
movement towards using biodiversity metrics in 
a comparative fashion that makes them increas¬ 
ingly useful in an applied context (Chapters 9, 
10,17, and 18). 

6. Non-species-centric measures: The study of 
the distribution and abundance of species has 
been a cornerstone of ecology (Krebs 1972; 
Andrewartha & Birch 1984), but as the limits of 
such an approach have become apparent (Law- 
ton 1999) ecologists are increasingly looking else¬ 
where for meaningful assessments of diversity 
that are not species centric (Webb et al. 2002; 
McGill et al. 2006b). This distinction is implicit 
in the organization of this book, with several 
non-species-specific measures emerging (Chap¬ 
ters 13,14 and 15). 

7. Continued emphasis and education on the 
most basic measures: Because of the societal 
importance of biodiversity, ecologists should not 
lose the simple, intuitive ideas of biodiversity 
just because more complex approaches exist. 
Specifically, we must continue to emphasize 
and educate on the proper use of such basic 
aspects of biodiversity as species richness (Chap¬ 
ter 4) and the existence of many rare species 
(Chapter 8). 

8. Process based: One trend that we haven't 
seen but would like to see is a greater tie 


between processes that control biodiversity and 
the metrics that measure biodiversity. Nearly 
all the metrics described in this book are basi¬ 
cally descriptive in nature, and if they tie to 
any process at all it is a grossly oversimplified 
null process such as Poisson spatial randomness 
(Chapters 11 and 12). Aspects of biodiversity as 
distinct as j8 diversity over space, species abun¬ 
dance distributions, and species-area relation¬ 
ships are being unified under quantitative pre¬ 
dictive theories (Hubbell 2001; Harte 2008; Mor¬ 
ion et al. 2008; McGill 2010). This suggests that 
there may be key parameters in these models 
that could at once subsume a panoply of met¬ 
rics and provide metrics that are in some sense 
more fundamental. We hope that these types of 
measures will be identified. 

Comparing to the last book on biodiversity mea¬ 
surement, reviewing the eight trends listed above, 
and noticing the well over 100 metrics of biodiver¬ 
sity described in this book all make it inescapable 
that the toolkit for measuring biodiversity has 
become bigger, more sophisticated, and more com¬ 
plex over time. And it will continue to get even 
bigger and more sophisticated and complex. Is big¬ 
ger better? The honest answer is, 'it depends.' It 
depends on at least three factors, which are covered 
in the next three paragraphs. 

On one hand, it is natural and good that as 
the field of biodiversity studies matures, it will 
become more refined and nuanced in its ability to 
measure biodiversity. The Convention on Biolog¬ 
ical Diversity (CBD) defines biodiversity broadly: 

' "biological diversity" means the variability among 
living organisms from all sources including, inter 
alia, terrestrial, marine and other aquatic ecosys¬ 
tems and the ecological complexes of which they 
are part; this includes diversity within species, 
between species and of ecosystems.' It would be 
odd if scientists had a more simplistic approach 
to biodiversity than politicians and were unable to 
measure all the aspects and nuances described in 
the CBD. Moreover, a nuanced view of biodiver¬ 
sity allows the choice of measures that are appro¬ 
priate for given questions. For example, the effects 
of biodiversity on ecosystem function are proba- 
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bly more related to measures of functional diver¬ 
sity than species richness (Tilman et al. 1997b), but 
the resilience to successfully respond to long-term 
change probably would depend most on genetic 
diversity and phylogenetic diversity. 

On the other hand, bigger and more complex 
is not good if it is a consequence of laziness. 
Specifically, if there is no pruning, no rejection 
and removal of failed attempts, then it is certain 
that much of the current large complex edifice of 
measuring biodiversity should not be there. For 
example, in the field of species abundance dis¬ 
tributions, MacArthur proposed the broken-stick 
model in 1957 (MacArthur 1957) and already by 
1960 (MacArthur 1960) had clearly stated in print 
that he considered the model a failed attempt that 
should not receive further attention. However, it 
continues to be studied (Wilson 1993) and taught 
in basic textbooks (Morin 1999; Begon et al. 2006). 
Similarly, a number of authors have attempted to 
compare the relative performance of different mea¬ 
sures, especially evenness measures (Smith & Wil¬ 
son 1996), but the rejected measures continue to 
be popular. It seems unavoidable to conclude that 
ecologists have not done a good job of simplify¬ 
ing as we go. Many of the chapters in this book 
have performed rigorous studies comparing differ¬ 
ent approaches and measures leading to specific 
recommendations about what should be used and 
not used going forward. 

The above two 'it depends' qualifications are the 
traditional give and take of science—continuously 
expanding the frontiers while hopefully showing 
progress through identifying ideas that work and 
don't work. However, for better and worse, ecol¬ 
ogists' efforts to measure biodiversity do not play 
out solely on the scientific stage. Societal inter¬ 
est in and policy debates over how to measure 
biodiversity and what aspects of biodiversity are 
important can be found everywhere. Given this, we 
must avoid the common mistake made by scien¬ 
tists of revelling in our ability to achieve complexity 
and subtlety. We must be unmistakably clear that 
despite internal debate and ever-increasing com¬ 
plexity, there is a clear and meaningful concept of 
biodiversity. Right now, today, we can make defin¬ 
itive statements about the fact that biodiversity 


is decreasing at historically unprecedented rates 
(Millennium Ecosystem Assessment 2005), that 
some places have more biodiversity than others 
(Pianka 1989; Reid 1998), that certain policies and 
management practices maintain biodiversity better 
than others (Bawa & Seidler 1998; Tscharntke et al. 
2005), and that biodiversity has benefits for society 
(Daily et al. 1997; Loreau et al. 2002). The overview 
of this book (Chapter 1) invoked the analogy of 
blind people trying to understand an elephant and 
returning a complex, multifaceted answer, which is 
accurate, but at the same time it seems unlikely that 
the people in the room were unaware that there 
was a big impressive thing also in the room! This 
ability to be multifaceted while not losing sight of 
the basic storyline must be how we approach the 
measurement of biodiversity going forward when 
all eyes of society are on us. 

In conclusion, in addition to the eight specific 
research agendas identified above, future work on 
biodiversity needs to be guided by broader prin¬ 
ciples: (i) continue to refine our ability to measure 
different facets of biodiversity even while (ii) prun¬ 
ing the failed ideas and (iii) keeping firm the real¬ 
ity and measurability of biodiversity to the general 
public. 

This book set out to provide an accessible 
description of the frontiers of measuring 
biodiversity. We hope the reader has found the 
journey as exciting, inspiring, and important as 
we do. We end, as we began, by quoting Charles 
Darwin's On the Origin of Species (1859); specifically 
we quote his last sentence which is a meditation 
on whether scientific understanding of biodiversity 
increases or decreases the ability of the planet's 
diverse creatures to inspire awe in us. Although 
Darwin speaks of the improved understanding of 
evolution, we would suggest his conclusion applies 
to the improved measurement of biodiversity 
as well. 

'There is grandeur in this view of life, with its sev¬ 
eral powers, having been originally breathed into a 
few forms or into one; and that, whilst this planet 
has gone cycling on according to the fixed law of 
gravity, from so simple a beginning endless forms 
most beautiful and most wonderful have been, and 
are being, evolved.' 
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Abiotic environmental factors: 195, 
224,255,256 

abundance: xvi, 1-4,7,11 -16, 18-21, 
23, 26 -31, 33-36,44r-50, 53, 
55-64,85,86, 97,100,105,108, 
149-151,152,154,160,166, 199, 
206,229, 230, 232,274 
ABC (see abundance/biomass curve) 
absolute abundance: 73, 75,102 
abundance weighted: 199, 

206,207 

abundance/biomass curve: 243, 
244 

abundance variation: 261-263 
AW (see abundance weighted) 
cover: 19,20,105,124,137 
cumulative abundance curve: 246 
density: 11,17,19-21. 28-30,32 
evenness: 52,176 
global abundance: 97 
local abundance: 97, 98 
maximum abundance: 187 
minimum abundance: 187 
numerical: 101,103,124,137 
random: 188 ,189 
relative abundance: xvi, 26,27, 
33-36,42,45, 46,48,52,57-64, 

6 7-71, 73-76, 78,87, 93,98, 

99,101, 123,138,180-181, 

232,275 

sample abundance: 98, 99,112 
shared abundance: 76 
abundant species (see also common 
species): 97,99,206,207 
accuracy: 14,24, 52,139,237 
agroforest avian assemblage: 
frugivores: 259 
insectivores: 259 
nectivores: 259 
areal coverage: 20 

AIC (see Akaike's Information Criterion) 


Akaike's Information Criterion: 64, 
65,135-137, 273 
algorithms: 134,200,206,269 
bootstrap algorithm: 129 ,133 
hypergeometric algorithm: 128, 

129 

optimization algorithm: 127 
alpha (a) diversity: 4, 27,28,33,41, 
55,64,77,78,79, 81-84, 86-88, 
90, 93,185,187,197 
ammonoids: 272 
amphibian: 214,217 
ants: 214 

assemblage: 66,67, 69, 70, 72-79, 
82-84 

multiple assemblage: 69-72, 74-76, 
80-82 

random assemblage: 189-190 

BAC (see bacterial artificial 
chromosome) 

Banks, Joseph: xv 
Barcode of Life Datasystems: 

214 

basal: 202, 203 
Bayes factor: 135,136 
Bayes' theorem: 135 
Bayesian Information Criterion: 
135-137 

beetles 102, also see Coleoptera 
beta (£) diversity: 4, 27,33,41, 55, 64, 
66,72,76,78-82,84,87,141,151, 
185,197, 250, 259, 291 
Whittaker's beta diversity: 71 
bias: 14,15,17, 18, 24,25, 28,29,31, 
32, 34-36,39-42,44,51-53,72, 
75,83,114-226,181,187,188, 

223,228 

measurement bias: 14 
numerical bias: 190 
parameter bias: 128-130 


sampling bias: 14-26,17, 39, 48, 53, 
76, 80, 81, 83, 
sample size bias: 279,282 
size: 14,15 
species: 15, 24 
BIC (see Bayesian Information 
Criterion) 

binning: 107, 109, 137,163, 

265, 267 

biodiversity: xvi, 1-3,4-7,25-29, 
34-36,39, 41, 47,48, 50, 51, 53, 
55, 74,82, 85 -88,92, 93,94,98, 
99,141,143,152,154,176,177, 
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259-261,276,291-293 
assessment 88 
behavior: 2, 63, 86 
components: 26 
loss: 1, 4, 6 

measures: 1-6,25, 26, 28-30, 253, 
259,254 

morphology: 2, 208,209,211,212 
regional: 25, 27, 35, 36 
bioindicator: 255-257, also see 
environmental indicator 
biological control: 
natural enemies: 
pollination: 
provision: 

biological hierarchy: 255,261 
biomass: 11-13,18-20,99,103,105, 
124,137,187,188 
measuring: 19, 20 
biotic environmental factor: 224, 

255,256 
bison 99 

body size distribution: 123, 245 
BOLD (see Barcode of Life Datasystems) 
Bonferroni test: 161 
bootstrapping: 187 
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Bray-Curtis index: 74, 75, 84 
Brillouin index: 56, 61, 241 

CA ( see correspondence analysis ) 
Canberra index: 69, 70 
capture mark recapture approach: 

270 

Carmago evenness: 57, 127,121,241 
catchability: 32 
CBD ( Convention on Biological 
Diversity) 

CBOL (Consortium for the Barcoding of 
Life) 

CDF (see cumulative distribution 
function) 

census: 11,22,61, 89,90,94,98,102, 
187, 211,291 
census interval: 90,94 
centroid: 76,180,181 
Chao-Jaccard abundance index: 
69-71, 76 

Chao-Sorensen abundance index: 
69-71, 76 

chronostratigraphic units: 265, 266, 
267, 271 

faunal zone: 266 
stages: 265 
substages: 265 
CHV (see convex hull volume) 

CITES (see Convention on International 
Trade of Endangered Species) 
clams: 11 

climate change: 85,103, 238, 239, 252, 
256, 258 

clonal propagation: 20 
clumping: 112, 154,160,162,166,178 
cluster analysis: 113,120 
cluster sampling: 17,19, 21 
CMR approach (see capture mark 
recapture approach) 
coelacanths: 100 

COI (see cytochrome c oxidase I gene) 
collector curve: 231 
Coleman curve: 46 
Coleoptera: 208, also see beetles 
common species: 98-100,102,106, 
113,259 

commonness: 4,97,102,105 
community: xvi, 1, 3, 4, 5,12-15, 21, 
25-28,31,35,36,39,41,55-64, 

68, 74, 82, 85-87,92, 93,105,125, 
129, 139,176,184,191-193,197, 
198,205,206,208,228, 232,238, 
241,245 

benthic community: 134 
CDI (see community disturbance 
index) 


clustered community: 202 
community assembly: 105,177, 
185,202 -204 

community composition: 66, 85, 

91,188, 222,228,247, 250,251 
community disturbance index: 250 
community diversity: 208,212 
community ecology: 105,175,176, 
195,205, 249 

community size spectra: 245 
ecological communities: 92,195, 
203 

expressed community: 31,87 
local community: 62,132,172,195 
metacommunity: 39,41, 253 
overdispersed community: 204 
phylogenetic community: 195, 205 
phylogenetic community ecology: 
206, 207 

potential community: 64,87 
randomly assembled community: 

201,203,206 
subcommunity: 114 
community fingerprinting 
techniques: 224 
ARISA (see automated RISA) 
automated RISA: 224,227,230 
denaturant gradient gel 
electrophoresis: 224,226 
DGGE (see denaturant gradient gel 
electrophoresis) 

ribosomal intergenic spacer region: 
224, 227 

RISA (see ribosomal intergenic spacer 
region) 

single stand conformation 
polymorphism: 224 
SSCP (see single stand conformation 
polymorphism) 

terminal restriction fragment 
length polymorphism: 224, 227 
T-RFLP (see terminal restriction 
fragment length polymorphism) 
competition: 195 

complete spatial randomness: 160, 
161,162,166,168 
composition: 66 

compositional differentiation: 66, 
67,77 

compositional similarity: 66, 67, 75, 
77, 78 

relative compositional 
differentiation 67 
relative compositional similarity 
67,68 

confidence interval: 21, 40, 45, 47, 52, 
81, 83, 271, 272, 275 


confoundment: 108 
Connolly algorithm: 134 
conservation biology: 105,265,271 
Consortium for the Barcoding of 
Life: 213 

Convention on Biological Diversity: 
292 

Convention on International Trade of 
Endangered Species: 213 
convex hull: 178,180,181 
convex hull volume: 178,189,192 
correction factors: 15 
correlation: 189,191,201 
negative correlation: 202, 266 
partial correlation: 191 
correlogram: 161, 163, 164,168, 166 
correspondence analysis: 248 
covariance: 163, 165, 367, 168, 186 
anisotropic: 165 
isotropic: 165 
stationary: 165 

covariogram: 163, 164,165,168, 171 
Crenarchaeota: 229 
cross-correlogram: 166 
cross-covariogram: 166 
cross-kriging: 362, 167 
cross-semivariogram: 365,166 
CSR (see complete spatial randomness) 
cubic polynomial: 159 
cumulative distribution function: 

108, 109,164 

cytochrome c oxidase I gene: 230, 
212-215 

Darwin, Charles: xvi, 1,2, 3, 7, 293 
data: 

abundance data: 39-41, 50,53,67, 
81, 121,146,148,150,262,280, 
283 

empirical data: 146 
frequency data: 20, 80, 187 
genotypic data: 217 
geospatial data: 354,156,367 
gridded data: 158, 364, 365,167 
incidence data: 39-41,43, 47, 50, 

76, 279 

intraspecific data: 146 
occupancy data: 141,148,150 
occurrence data: 143,151 
presence-absence data: 195,198, 
201,207,240,249,250,262,279, 
283 

presence data: 41, 81 
De Rerum: xv 

degrees of freedom: 159,165,169,272 
dendrogram-based indices: 178,179, 
182 
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detectability: 3,18, 25-36,125,137, 
143,150 

detection threshold: 226, 229,230 
designing a survey: 12,14, 23, 24 
Deviance Information Criterion: 135, 
136,137 

DIC (see Deviance Information 
Criterion) 

differentiation: 66, 67, 68, 76,77-79, 
81, 83,84 

directional change: 85,86,93 
discrete individuals: 43,48 
dispersion: 175,176 
overdispersion: 175 
spatial dispersion: 175 
spatial overdispersion: 175 
trait dispersion: 175-177,183 
trait overdispersion: 175 
distance sampling: 29 -31 
distance: (also see index of (dis) 
similarity) 

chi-square distance: 76 
Euclidan distance: 75, 91 
Hellinger distance: 76 
Lance Williams' Canberra 
distance: 76 

Manhattan distance: 75, 76 
Minkowski distance: 76 
Orloci's chord distance: 76 
phylogenetic distance: 194, 199, 
200 

distance measure: 73,75, 76 
distribution: 62, 200,228,230,231, 
233,234,252 

abundance distribution: 123,125, 
137 

chi-square distribution: 160 
chi-square null distribution: 133 
clumped distribution: 160 
Dirichlet distribution: 271 
empirical cumulative distribution 
58 

fossil distribution: 275 
frequency distribution: 125, 
132-134,137,142,261 
gamma distribution: 58, 112, 116, 
125,126 

geographic distribution: 274 
lognormal distribution: 58, 120, 
124,126,200,231,232,246,247 
multinomial distribution: 59,63, 

65, 269 

normal distribution: 131 
null distribution: 132-134 
parametric distribution: 73 
posterior distribution: 135-137, 

140 


probability distribution: 56-58, 61, 
130,135,136 

random distribution: 143,145,147, 
149 

spatial distribution: 125,141,143, 
147,150 

species-occupancy distribution: 
142,148,149 

species probability distribution: 67, 
79 

species rank abundance 
distribution (SRA): xvi 
true distribution: 125,135 
uniform distribution: 125,127,143, 
149 

disturbance: 5,97,224, 237 

anthropogenic disturbance: 237, 
238 

biological: 238 
chemical: 238 

disturbance frequency: 238 
disturbance regime: 238, 247 
duration: 238 
physical: 238 
pulse: 238 

natural disturbance: 238 ,242 
divergence: 67, 79, 211, 214 
DNA divergence: 214 
maximum divergence: 214 
phenotypic divergence: 212 
diversification: 200 

accelerated diversification: 196, 

200,203,206 

constant diversification: 196 
decelerating diversification: 196, 
200, 203,206 

diversity: 1-5,7,11-15,18-21, 23, 28, 
29,31,34,36, 66, 77, 78,83,89, 

91,152,154, 158, (also see alpha 
diversity, beta diversity, estimators 
of diversity, gamma diversity, 
Hurlbert diversity Margalef 
diversity, Menhinick diversity, 
Shannon diversity, Simpson 
diversity) 

assemblage diversity: 255, 261 
biological diversity (see also 
biodiversity) 208, 217,292 
classical diversity: 87 
diversity numbers: 56, also see Hill 
number 

functional diversity: 92,175,177, 
183,187,192,193,291, 292 
genetic diversity: 5, 208, 212, 217, 
291, 293 

high diversity: 1,152 
microbial diversity: 5 


low diversity: 1 

microbial diversity: 216, 221, 222, 
224-226,228,236 
phylo-j3-diversity: 92 
phylogenetic diversity: see 
phylogenetic diversity 
regional diversity: 82 
spatial diversity 92 
species diversity: 11-14,18, 20, 21, 
23,55,56, 59-63,77 
taxonomic diversity: 92,175,178 
temporal diversity 88, 92 
trait diversity: 175,176,178, 

183,185,188-190,193,240, 

243, 245 

DNA bar-coding: 212, 215 
DNA sequence: 194,208,209 
DNA sequence variation: 209 
chloroplast: 209,212,213 
cpDNA (see chloroplast) 
diploid system: 209 
haploid: 212 
haplotypes: 209 
mitochondrial DNA: 209 
mtDNA (see mitochondrial DNA) 
y-chromosome: 209 
DoD (see US Department of Defense) 
dominance: 56,57, 58, 60,61 
absolute dominance: 57 
dominant species: 243, 249 -251 
McNaughton dominance: 57 
relative dominance: 57 
dominance pre-emption species 
abundance model: 126 
Drosophila melanogaster: 256 
Dutelliel's correction: 169 

eCDF (see empirical cumulative 
distribution function) 

Ectopistes migratorius (see passenger 
pigeon) 

efficiency: 15,16,24,52,113 -116 
ecological threshold: 256 
ecomorphology: 176,185 
ecophysiology: 186 
ecosystem: xvi, 2, 5,15,17, 55, 61, 
237,238 

agro-ecosystems: 259,261 
ecosystem ecology: 176 
ecosystem function: 175-177, 

237,243 

ecosystem processes: 175,176,190, 

193.237 

ecosystem services: xvi, 2, 

176.238 

eigenvalue decomposition: 47 
electrofishing: 14 -16 
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empirical cumulative 

density/distribution function: 
58, 91, 107, 108, 109,122,244 
empirical rank abundance: 133 
empirical results: 206, 244 
empirical samples: 55,61-65 
empirical species-area relationship: 
134 

empirical study: 195,197, 206 
end-Cretaceous: 271 
end-Devonian: 270 
end-Ordovician: 270 
end-Permian: 270 
end-Triassic: 270,275 
entropy: (see Hill number, Renyi 
entropy, Shannon diversity) 
environmental indicator: 278, 286, 
287, also see bioindicator 
estimators of diversity: 41, 50-54 
ACE: 40, 50,51 

Chaol: 40, 50-53, 56, 61, 116,117, 
119 ,120 

Chao2: 40, 41, 50-53, 56, 61 
incidence coverage estimator 
(ICE): 40, 41, 51 

jackknife: 41, 50-52,83,279,285 
Michaelis-Menten estimator: 49,52 
ESU (see evolutionary significant units) 
Etienne algorithm: 133 
Euclidean Distance: 91,180,182,187 
eukaryotes: xv, 214,216 
evenness: 2, 3,5, 56, 57, 58, 60, 61,64 
87, 88,109,110, 114,176,178, 

181,187, 231,239,241,243, 245, 
(also see Carmago evenness, NHC 
evenness, Shannon evenness, Smith 
Wilson evenness) 
evolution: 258, 262 
evolutionarily significant units: 207 
expected mean: 200 
extrapolating: 44, 45, 49, 50, 52, 53, 
283,286 

Michaelis-Menten extrapolation: 
279, 283 

extinction: 265,267,269-273,275 
distribution of extinctions: 266 
local extinction: 86, 89,90,101 
mass extinction: 265,269, 270 
natural extinction: 291 
extinction risk: 206, 266, 274 

FAD (see functional attribute diversity) 
FD (see dendrogram-based indices ) 

FDis (see functional dispersion) 

FDiv (seefunctional divergence) 

FDPG: 183,191,192 
FDQ (see quadratic entropy) 


FDvar (see variance-based functional 
diversity) 

FEve (see functional evenness ) 

Felis silvestris complex: 211 
first (1 st ) order spatial effects: 153, 
157,159, 161,165, 

167,170 

fish 98, 99, 101, 102 ,102 
largemouth bass 15 
smelt 15 

Fisher's a: 58, 283 
Fisher's logseries: 120,246 
fossil record: 265,270, 275 
functional attribute diversity: 183 
functional dispersion: 179 ,180 
functional divergence: 179, 180,188, 

191.193 

functional evenness: 179, 181,183, 

188.191.193 
functional genes: 224 
functional morphology: 186 
functional richness: 243 
functional trait: 175-177,182,185 

Gambin alpha: 58 

gamma (y) diversity: 4,27,28,33,66, 
77-79,83,93,185,197 
gap analyses: 271, 274 
GDP (see gross domestic product) 
gear efficiency study: 15,16 
genetic analysis: 211 
general least squares: 165 ,169 
generalized logistic parameters 59 
genomics: 209, 217 
genome sequence: 
genotype: 39,209, 217 
geographically weighted regression: 
170 

geometric distribution: 108,110 ,112 
geometric mean: 26,33-36 
geometric series model: 103 
geospatial statistics: 155 
Giant panda (Ailuropoda melanoleuca) 
216 

Gini Simpson index: 69,77-79, 

81,84 

GLS (see general least squares) 

Gower distances: 79, 70,76, 

186,187 

gross domestic product: xvi 
Guassian bell curve: 158 
GWR (see geographically weighted 
regression) 

heterogeneity: 18, 21,25,26, 29-31, 
36, 63, 79, 83,143,149,261 
heteroplasmy: 216 
heterozygosity 88 


histogram: 107 ,109 
homogeneity: 18,26,56,66, 67,180, 
238,274 
homoplasy: 212 
homoscedasticity: 138 
Hill number or index or diversity: 61, 
78-79,241,291 

Horn index: 69, 71, 73, 74,82, 84, (also 
see Morisita-Horn) 
Houchmandzadeh correlation 
model: 170 

Hurlbert PIE diversity: 56, 117,120 
ID (see index of distribution) 
immigration: 85,89,101 
index of species distinctness: see 
species distinctness) 
index of (dis) similarity (also see 
Bray-Curtis, Chao-Jaccard, Chao- 
Serenson, Canberra, distance, 
Gower, Jaccard, Morisita-Horn, 
Sorenson) 

additive index: 69, 70 
correlation coefficient index: 69, 70 
general: 68 

normalized expected species 
shared index: 69, 70 
percentage similarity index: 69, 

70, 71 

index of dispersion: 160 ,168 
indicator species: 12 
interpolating: 45,49,52,157,159 ,165 
intervals: 266, 267 
interval length: 266 
invariance 

density invariance: 67 
replication invariance: 67,68 
IUCN Red List: 211 

Jaccard index: 68-70, 72, 76, 82,84, 

90, 249 

littoral zone: 13,14 
k-dominance plot: 245 
kernel estimation: 107 
kernel smoothing: 157, 158 ,157 
kriging: 157, 159,160,163, 165 ,171 
cross-kringing: 162,167 
Kulczynski-Cody index: 68,70 

Latimeria chalumnae (see coelacanths) 
landscape: 253, 254, 259 
agricultural managed landscape: 
259,261,264 

fire management regime: 254 
forestry managed landscape: 259, 
261 

landscape structure: 261 
landscape variation 255 
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management intensity gradient: 
260, 261 

managed landscape: 253, 256, 
258-260,263, 264 
management regime: 254, 257 
modified landscape: 253,258, 

259,264 

least squares: 130,131,137,138,159, 
(also see general least squares, 
ordinary least squares) 
conditions for least squares: 131 
Lennon index: see Sorenson, 
conditional 
lepidosaurs: 265 
lethal alleles: 217 
linear mixed model: 169 
linear regression: 280 
Linnaeus' binary codification: xv 
Living Planet Index: 92 
LMM (see linear mixed model) 
locality (see community) 
log abundance: 57, 58, 106,108, 

113,129 
log skew: 57, 58 
logistic parameters: 58, 59 
lognormal CV: 58 
logseries: 106,108, 112,115, 121 
Loxodonta africana: 211 
Loxodonta cyclotis : 211 

macroecology: 137,142,151, 224,229 
macroevolutionary: 195 
macroorganisms: 223 
Marczewski-Steinhaus distance: 184 
Margalef diversity: 3,56, 116,117, 
119,120 

matrix approach: 262 
maximum likelihood estimator: 83, 
111,112,119, 121 

MCDS (see multiple-covariate distance 
sampling) 

McIntyre dominance: 120 
McNaughton dominance: 114, 
116,117 

MD ( see mean distance) 

MDS (see multidimensional scaling) 
mean distance: 179, 180,182,184,188 
mean phylogenetic distance: 182, 

199, 200 

Menhinick diversity: 56 
metagenome: 228,229, 234 
metrics: 195 

common species metrics (see 
dominance metrics) 
distance based metrics: 198, 201, 
202, 205 

diversity metrics: 56, 61 


dominance metrics: 57, 61 
evenness metrics: 57, 61 
high rarity metrics: 57 
multivariate metrics: 239,251 
node based metrics: 196- 198,201, 
202,205 

pc (see per capita) 
pt (see per taxon) 
per capita: 266,267 
per taxon: 266, 267 
richness metrics: 56 
semi-parametric metrics: 58 
type 1:197, 200, 201 
type II: 197,198, 201 
univariate metrics: see univariate 
metrics 

MFAD (see modified functional 
attribute diversity) 
microbial ecology: 5, 222, 225, 236 
micro-evolutionary: 258 
micro-heterogeneity: 229 
microorganisms: 208, 214, 221, 223 
microsatellite DNA loci: 208 
minimum spanning tree: 177, 179, 
178,182,186,190 
mixture models: 49 
MLE (see maximum likelihood 
estimator) 
mobility: 18 
model analysis: 
absolute goodness of fit: 124 
asymptotic test: 132,133 
chi-square: 133 
deviance: 133,136,137 
Engen-Lande model: 124 
goodness of fit test: 131,132,134, 
135,138-140, 245,246 
Kolmogorov-Simimov test: 133 
model fitting: 123,124,128,132, 

133.137.139.140 

model selection: 124,132,134—136, 

139.140 

phenomenological model: 124 
theoretical models: 124 
Tokeshi simulation: 133 
modified functional attribute 
diversity: 183 

molecular (genetic) market: 209, 210, 
217 

molecular genetics: 209,211 
molecular phylogeny: 223 
molecular scatology: 212 
monotonicity: 67 
Monte Carlo simulation: 267 
moths: 31,67,68 

Morisita-Hom index of similarity: 69, 
71, 73, 74, 82, 84,90,162 


morphological analysis: 211, 221 
morphospecies: 5. 
motility: 13,18 

Motomura model: 106,123, 246 
MPD (see mean phylogenetic diversity) 
MST (see minimum spanning tree) 

MST curve: 183,188 
multidimensional scaling: 249 
multiple-assemblage similarity 
profile: 80, 81 

multiple-covariate distance sampling 
engine of distance: 33 
multivariate analysis: 237, 250 
multivariate dispersion: 76 

NHC evenness: 57 
NMDS (see non-metric MDS) 
Nachman model: 143,146 
National Research Council: 278, 

286, 287 

NJ methods: 215 
non-metric MDS: 249, 250, 251 
normalized difference vegetation 
index (NDVI): 259 

Nuclear mitochondrial pseudogenes: 
215 

numts (see Nuclear mitochondrial 
pseudogenes) 
natural selection: 1 
nearest neighbor: 162,163,166, 

181,188 

Blyth-Ripley ratio: 162 
Clark-Evans ratio: 162 
Hopkins ratio: 162 
nearest neighbor interpolation: 

157 

nearest neighbor metrics: see 
nearest neighbor metrics 
nearest neighbor metrics: 188,193 
density of species packing: 179, 
183,189,191 

evenness of species packing: 

183,189,191 

evenNND (see evenness of species 
packing) 

meanNND (see density of species 
packing) 

regularity of species packing: 

183 

sdNND (see regularity of species 
packing) 

Neofelis diardr. 211 

Neofelis nebulosa: 211 

net primary productivity: 254 

neural circuitry: 2 

neutral model of biodiversity: 

74,84 
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niche: 185,187 

competition-niche model: 177 
mechanistic niche modeling: 262 
niche apportionment: 97,100,124, 
126,137 

niche partitioning: 99,177 
niche preemption: 106 
niche similarity: 123 
realized niche: 185 
nonthermophilic: 229 
normalizing: 75,128,136 
North Atlantic Oscillation: 258 
NRC (see National Research Council) 
nuclear DNA: 209 
null model: 176,189,190,198,199, 
205,206,263 

occupancy: 15, 59, 98, 141,142, 
148-150,160 
absence: 280 

occupancy-abundance relationship: 

143,145-147,150,151 
occupancy-area relationship: 141, 
143,148-150,262 
occurrence: 2,141,143 
Ochiai index: 68 ,70 
OLS (see ordinary least squares) 
operational taxonomic unit: 222 
orders of magnitude: 1,3,232, 232, 
235 

ordinary least squares: 167 

ordination: 113 

organism size: (see species size) 

origination distribution: 265,266 

origination rate: 265, 267 

orthogonality: 114,115 

OTU (see operational taxonomic unit) 

overdispersed: 155 

palaeontological: 86,265 ,267 
Paleobiology Database: 270 
parameter: 55, 56, 58, 59, 62, 63, 

126,127 

estimates: 127,129,130 
migration parameter: 131 
neutral model parameter: 134 
parameter bootstrapping: 133, 

134 

parameter estimation: 126,128, 
130,137,139 
rate parameter: 126 
scale parameter: 126 
smoothing parameter: 158,160 
shape parameter: 126 
Pareto distribution (see also Power 
distribution ): 111, 113 
partitioning 


additive partitioning: 79, 81, 

82, 84 

multiplicative partitioning: 79, 

81,84 

passenger pigeon: xvi, 99 
PBDB database: 270,271,274 
PCA (see principle components 
analyses) 

PCoA (see principle coordinates 
analysis) 

PCR (see polymerase chain reaction) 
PctRare 
1%: 58 
5%: 58 
N/S: 58 

PD (see phylogenetic diversity) 

PDF (see probability density function) 
Pearson's x 2 test statistic: 131-134 
Pearson correlation: 162,188 ,190 
Phanerozoic: 267, 270 
phenotypic: 172 
phenotypic change: 258 
phenotypic plasticity: 185,261 
phenotypic variation: 176 
phototrophic: 229 
phylogenetic: 5,84, 86,133,175 
phylogenetic clusters: 200,206, 207 
phylogenetic diversity: 5, 92,175, 
179,195,197, 199, 207,240,241, 
291 

MNND (see mean nearest neighbor 
distance) 

MPD (see mean phylogenetic 
distance) 

mean nearest neighbor distance: 

183,193, 199, 201, 203,205 
mean phylogenetic distance: 182, 
199, 201, 203,205 
SPD (see sum of phylogenetic 
distance) 

sum of phylogenetic distance: 199, 
201 

phylogenetic overdispersion: 200, 

203.206.207 

phylogenetic tree: 196, 197,199,207, 
222, 223 

balanced tree: 196, 200, 202-204, 

206.207 

imbalanced tree: 196, 200, 201, 203, 
204, 206,207 
phylogeny: 209 
local phylogeny: 197 
regional phylogeny: 197, 200 
subset phylogeny: 197 
physiological properties: 223 
physiological variables: 262,263 
desiccation resistance: 262 


metabolic rate: 262 
thermal tolerance: 262 
pilot surveys: 16,18, 24 
plot: 11,13,15,17-23 
nested: 18, 19, 22 ,23 
number of plots: 280 
modified Whittaker: 18 
Whittaker: 18 ,19 
population: 11,13-16,17-19,21, 

23,24,30-35, 208, 

220,211,216 

human population: 252, 254, 

263 

population density: 254 
population maintenance: 255 
population reduction: 255 
Poisson algorithm: 128 ,129 
Poisson distribution: 58,112, 125, 
133,146,160 ,161 

Poisson lognormal: 53, 222,115, 118, 
125,126,128 ,129 
Poisson variability: 131 
polymerase chain reaction: 223 
posterior distribution: 135-137 
posterior odds ratio: 136 
PowBend distribution: 111, 115, 

119 

Power distribution (see also Pareto 
distribution)-. Ill, 114 
power law model: 143,144,146 
precision: 14, 16-18, 24,52,138 
presence: 41,48 

Preston's data smoothing method: 
107 

Preston's lognormal: 110 
Preston plots: 99,109 
Preston's temporal scales: 86 
principle components analyses: 186, 
292,248 

principle coordinates analysis: 186, 
192,249,250 

probability density function: 107, 
164, 166,168 

probability of detection: 25,28, 30, 
33,36 

prokaryote: 221,224,228 
prop LN: 58 
proteomics: 229 
proteorhodopsin: 229 
Pyrenean ibex (Capra pyrenaica 
pyrenaica): 217 

pyrosequencing: 225, 227, 229, 232 

quadrat data: 156,163 
quadratic entropy: 179 ,182 
quadratic polynomial: 159 
qualitative: 195,197,200 
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quantitative: 105,106,178,186,187, 
195,200 

quartic polynomial: 159 

rxc approach: 255, 262-264 
RAD (see rank abundance distribution) 
RAD beta (see NHC evenness) 
Rabinowitz approach: 103 
Rabinowitz's seven forms of rarity: 
xvi 

random sample: 17, 21,25,27, 28,36, 
42,48-50,52,74,125,134,137 
RDA (see redundancy analysis) 
random placement: 48 
range size: 

range size frequency distribution: 
123 

rank abundance: 99,102,103, 227, 

130,131,134,135,231 
rank abundance curve: 101 
rank abundance distribution: 91, 106, 
108, 127,130,138,225,243 ,244 
rank abundance plot: 107, 243 
rare species: 26, 27, 29,30, 31, 33,35, 
36,40,42,43,48,50,54, 56-58, 
60-62,72,73, 83,97, 99,101,102, 
105,106, 109, 121, 206, 208, 212, 
239,245,250,292 

rarefaction: 39, 42, 45-49, 52, 53, 62, 
187,197, 283 

individual-based: 42,45,46, 47, 

48.53 

sample-based: 42,44,46-49, 

52.53 

rarefaction curve: 42,45, 46,48,49 
individual-based rarefaction 
curve: 42,45-47 

sample-based rarefaction curve: 

42, 44,47,49 ,51 
rarity: xvi, 4,16,19,31,57 
rates for extinction and origination: 
266 

continuous: 269,271,272 
gradual: 271 
pulsed: 266, 269, 271 
redundancy analysis: 248 
regional heterogeneity: 79 
remote sensing: 19,20 
Renkonen index: 75 
Renosterveld: 253 

Renyi entropies of order: 56, 60, (also 
see Hill) 

representativeness: 
of sites: 26,32,35 
of species: 25 

restriction fragment length 

polymorphism analysis: 208 


ribosomal RNA: 210, 214, 221, 

223, 229 

RFLP (see restriction fragment length 
polymorphism analysis) 
rRNA (see ribosomal RNA): 
rxc matrix approach: 253,255, 
262-264,263 

SAD (see species abundance 
distribution) 

SAM (see spatial analysis of 
macroecological data) 
sample 

ecological sample: 55,57,59-65 
empirical sample: 55,61-65 
sample-based species accumulation 
curve: 43 

sample size: 4,16,18, 21,24,26, 72, 
75,83,224,226-229, 

232-234 

sampling: 11-24,265,271,274 
adaptive: 17,19 
design: 12,13,17,19,21 
destructive techniques: 15 
double: 15 ,16 
effort: 66, 72,83 
error: 11,14,17, 21, 24,103 
fraction: 72, 75 
frame: 13,14,17, 21, 24 
intensity: 240 

method: 12-18, 21,23,24,48,253, 
264 

preservation (see also rate): 265, 271 
rate: 265,267,268 
sufficient: 47 
unit: 17,18, 20, 21 
seasonality: 20 

second (2 nd ) order spatial effects: 

153,154,159,160, 161,165, 

166,170 

semivariogram: 158, 264-166 
sequencing: 224, 228, 231 
parallel pyrosequencing: 225 
sequencing error: 225, 227, 232,234 
sessile 18 

Shannon diversity (and 

Shannon-Wiener): xvi, 3,15,19, 
27-28,33-36, 56, 61,63-65, 

87-88, 91,114-117,120-121, 241 
Shannon entropy: 73,77-79, 81, 

83,84 

Shannon evenness: 57,64,114, 116, 
127,121 

SHE analysis: 241 
Signor-Lipps effect: 271 
Similarity measures: 67, 71, 76, (also 
see index of (dis)similarity) 


abundance-based similarity 
measures: 73 

incidence-based measures: 68, 76 
incidence-based similarity 
measures: 72, 73 

multiple-assemblage similarity: 80 
Simpson diversity: 3,15,19,27-28, 
33-37,56,60-61,87-88,91, 
114-117,120-121,183,188,191, 
241,292 

Simpson evenness: 57, 64, 116,117, 
121, 240 

single nucleotide polymorphism: 

208, 210 

singletons: 100,103,131, 228, 232, 266 
percent singletons: 58,116,117,119 
Smith Wilson evenness: 57, 116,117, 
121, 241,242 
smoothing: 157,160,165 
SNPs (see single nucleotide 
polymorphism) 

Sorenson index: 68, 69, 70-75, 82, 
84,90 

conditional Sorenson index: 68, 
70-71 

Sorenson-Dice index: 68 
SPADE program: 82 
SPAR (see species area relationship) 
spatial analysis of macroecological 
data: 170 

spatial pattern in relative abundance: 
123 

spatial scale: 152,156,238,239,253, 
259,261 

continental/global scale: 253 
sub-regions: 253 
spatial structure: 156,160 
spatial regression: 161, 167-169 
spatial variation: 66 
Spearman correlation: 162 
species: 55 

cryptic species: 208, 209, 212,213 
invasive species: 252,254, 255 
shared species: 68,70-76,82 
species abundance: 197,198,206, 
207,239,243,247,249, 250 
species abundance analysis: 
132-135,137-140 

species abundance curve: 229, 230, 
232,246 

species abundance data: 195,197 
species abundance distribution: 23, 
31, 41,49-51, 53,57,59,62,63, 

73, 78, 88, 90, 93,94,97,101,103, 
105,106,114,122,123,130,131, 
134,137,138,139,148, 149, 234, 
243,247,2 48, 261,292,293 
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species: ( cont .) 

species abundance model: 124, 

125,132,137,138,140 
species accumulation curve: 21-23, 
42, 43, 45, 49, 62, 280, 283 
species area curve: 87, 89,101,151, 
276, 277,287 

species area relationship: 141, 262, 
276,277,282, 287 
species composition: 66 
species density: 5,6,11,43-45, 67, 
105,276, 277, 282, 286, 287 
species diversity: 208,213,217, 

237 

species evenness: 26,27, 29,57, 58, 
60,61,181,188,194, 199 ,246 
species frequencies: 73, 75, 82, 83, 
199 

species richness: 2,3,5, 6,15,19, 
21,23,26, 27,31,32,39-54, 56, 
59-62, 72, 79-81, 83, 84, 87-89, 

98, 100, 108, 112,122,154, 157, 
167,178,181,183,189,191,194, 
199, 200, 228,233,244, 253,254, 
259,292 

species-time-area-relationship: 89, 
123 

species time curve: 86 -88, 89,94 
species distinctness: 
originality of species within a set: 
197, 198, 205 

OSS (see originality of species within 
a set) 

PE (see pendant edge) 
pendant edge: 197, 198, 205, 207 
SEH (see species evolutionary history) 
SO (see species originality) 
species evolutionary history: 197, 

198. 205.207 

species originality: 197, 198, 205 
taxonomic distinctness: 197, 198, 

205.207 

TD(see taxonomic distinctness) 
sphenodonts: 265 
tuatara: 265 
STAR (see 

species-time-area-relationship) 
standard deviation: 16, 17, 111, 120, 
126,129,181,186,200, 204,232 
standard error: 17 
standardization: 23,181,186 
statistical techniques: 23 
statistician: 12, 24 
stochastic model: 125 
demographic stochasticity: 125, 

140 

stochastic abundance model: 124 


stochastic community dynamic 
model: 125 
stochastic error: 126 
stochastic sampling: 125 
stochastic sampling variation: 133 
stochastic simulation: 126,127, 

130,137,140 

stochastic simulation error: 127 
stochastic variation: 123,126,130, 
133,137 

STR (see microsatellite DNA loci) 
stratified random sample: 17, 21, 25, 
28,32 

stratigraphic range: 265, 267, 268, 

271 

survey: 11,12,14,15,17,18,23,24 
design: 12,14, 23,24 
initial survey: 18 
objective: 12, 21, 24 
pilot survey: 16,18, 24 
survivorship curve: 266-269 
backwards survivorship curve: 
267, 269 

forwards survivorship curve: 269 
Swingle, Homer S.: 11 
synoptic database: 265 
synoptic range: 268,269 
synthetic framework: 291 
systematic sample: 17, 21 

target population: 13,14 
taxonomic: 3,5, 39,46,48, 86,92,105, 
194,195, 291 

taxonomic distinctness: 197, 198, 

241, 243 

Taxus baccatus (see yew tree) 
temporal change: 85,86 
directional change: 85 
non-directional change: 85, 86 
temporal duration: 267 
temporal length: 266 
temporal patterns: 85, 86,93 
temporal patterns in relative 
abundance: 123 
temporal scale: 86, 275 
ecological time: 86 
evolutionary time: 86 
sampling time: 86 

temporal turnover: 31, 85, 88-90, 94, 
101 

theoretical ecology: 3. 
time curve: 4,6 
timing: 21, 23 

trait diversity: 2, 5,175,177,178, 379, 
184,188,189,193 
inter-specific trait: 184, 213-215, 
262,263 


intra-specific trait: 184,185, 
213-215,262, 263 
traits: 194 

binomial: 184,187, 192 
circular: 187 
continuous: 185 ,192 
convergence: 194 
evenness: 179 ,181 
functional: 175,185 
fuzzy: 184,187 
individual: 194 
mean: 180,184 
nominal: 184,187 ,192 
ordinal: 177,184,187 ,192 
random: 188 ,189 
trait dispersion: 175,177,183 
trait diversity: see trait diversity 
trait richness: 179, 189,193 
transect data: 156 
trawl 15,17 
trees: 80 

trend surfaces: 158,159 
global surfaces: 158,159 
trilobites: 265 

Cambrian trilobites: 267,275 
Ordovician trilobites: 275 
trophic level: 185,190, 222 
truncation: 125,128,271, 273, 

281 

UK Breeding Bird Survey: 25,32 
UK's Butterfly Monitoring Scheme: 
32 

unbiased: 52,113 
uncertainty: 36 
unimodal: 159, 240 
United Nations International Year of 
Biodiversity (IYB): 1 
univariate metrics: 237,239,2 40 ,250 
species evenness metrics: 239, 240 
species diversity metrics: 239 ,240 
species richness metrics: 239, 240 
urbanization: 254 
US Department of Defense: 276 
US Endangered Species Act: 211 
US National Park Service: 276 

variability: 16, 17,21,23,42,47 
variance: xvi, 42,43,45-47,49,52, 56, 
57,59-61,63, 64, 72, 73,131,159, 
160,186,202,205 
conditional: 42, 43, 45, 47,52 
environmental/spatial: 66,169 
neutral variation: 66 
percent (%) variance: 132,169 
residual variance: 131,138 
taxa variance: 184 
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unconditional variance: 42,45-47, 
52 

unconditional variance estimator: 
46,47 

unexplained: 169 

variance-based functional diversity: 
184 

variation: 

adaptive variation: 217, 218 
genetic variation: 208,216, 

217 

inter-annual variation: 258 
macro scale variation: 253 
variogram: 158,164 ,170 


cross variogram: 157 
vector: 228, 269 

Virginia Natural Heritage Program: 
278 

VNHP (see Virginia Natural Heritage 
Program) 
vulnerability: 2 

Wang/Solow and Everson approach: 
272, 273 

Weibull distribution: 112, 116,120 
Wilcoxon test: 129 
Wisconsin double standardization: 
249 


Whittaker plots: 

99,185 

WS2M program: 279 

yew tree: 100,101 
ZSM (see zero-sum multinomial) 
zero-sum multinomial: 110,112,118, 
119, 120,121 

Zipf-Mandelbrot distribution: 
111,113-115,118,120, 

122, 271 

z-transform quantitative trait data: 
186,187 



